U. S. Department of Transportation



Download 2.66 Mb.
Page25/35
Date02.02.2017
Size2.66 Mb.
#16216
1   ...   21   22   23   24   25   26   27   28   ...   35

Data Issues


This appendix contains the output from the “Issue Tracker” maintained as part of the data cleaning process.


ID

Dataset

Variable Name

Issue

Comments

Plan

1

AP

all

Missing Airport: CGI







2

AP

all

Missing Airport: ESN







3

AP

all

Missing Airport: FTG







4

AP

all

Missing Airport: GKY







5

AP

all

Missing Airport: GLS







6

AP

all

Missing Airport: GTR







7

AP

all

Missing Airport: GYH







8

AP

all

Missing Airport: HXD

only incident has no time associated

observation will be dropped

9

AP

all

Missing Airport: JST







10

AP

all

Missing Airport: MER







11

AP

all

Missing Airport: MYV







12

AP

all

Missing Airport: OMN

note, airport closed




13

AP

all

Missing Airport: SPG







14

AP

all

Missing Airport: UCA







15

AP

all

Missing Airport: VBG







16

AP

all

Missing Airport: VCV







17

AP

ALL

definitions of the variables

Many of the variables, while having a plain-text definition, do not indicate how they were gathered (e.g., What is a "short taxi"), over how long (e.g., average rainy days)

Request clarification from FAA;

18

AP

geo_bullseye_flag

definitions of the variables




More than 2 taxiways intersecting in the same area

19

AP

geo_mult_rwy_crossing_flag

definitions of the variables




Does pilot have to cross multiple rwys to get to a departure rwy

20

AP

geo_num_hotspot

definitions of the variables




This information is in the airport diagram charts (Marked as "HS 1", "HS 2", etc.)

21

AP

geo_num_rsa_isect

definitions of the variables




Use the standard definition for the Runway Safety Area to determine whether there is intersection (1000ft at approach and departure ends).

22

AP

geo_num_taxi_x_runway

definitions of the variables




In this case, count the number of taxiways that cross more than 2 runways

23

AP

geo_rwy_close_flag

definitions of the variables




Again, there is no "definitive" criteria in the CAST report. This applies to cases where the rwys are NOT parallel, but there is still a possibility for confusion (e.g., I'm assigned to land on rwy 35, but I land on rwy 4, b/c the rwy ends are very close. (E.g., see HUT hotspot #2). Suggested criteria: (a) rwy intersection angle less than 50 degrees, and (b) both rwy ends within 500 ft of the intersection point

24

AP

geo_rwy_crossing_flag

definitions of the variables




Does pilot have to cross a rwy to get to a departure rwy? Basically the answer is "No" if you have only 1 runway (or, if you have more than 1 rwy but NO twy crosses any rwy).

25

AP

geo_rwy_num_isect

definitions of the variables




use as is

26

AP

geo_rwy_num_t_isect

definitions of the variables




use as is

27

AP

geo_rwy_parallel_flag

definitions of the variables




are there parallel runways (these can be determined w/o an airport diagram by looking at the rwy numbers, eg., 26R / 26L)

28

AP

geo_taxi_short_flag

definitions of the variables




Could not get any information. Suggest taking a sample of all airports for which this item is flagged, then of the sample, use the greatest observed value as our value? F26

29

AP

lahso_flag_ap

definitions of the variables




use as is

30

AP

locid

definitions of the variables




use as is

31

AP

runway information

KWA

need to manually pull




32

AP

runway information

PFN

need to manually pull




33

AP

runway information

OMN

need to manually pull (NOTE: 5010 contains information for an airport that has taken over the code)




34

AP

traffic variables

One airport sums to 111%

operations percenages at Kalaeloa Airport (JRF) sum to 111%




35

AP

weather variables

combined weather stations

Many airports in a region seem to share weather data, even if they are not correct (e.g., Hyannis, MA and Lawrance, MA)




36

AP




15 airports not included in dataset that are in RI







37

OE

acft_evas_actn_code

27 missing values

Replaced missing with unknown

 

38

OE

acft_gnd_spd_kt_qty

1133 missing values

codebook (but not in data) contains ACFT_GND_SPD_UNKN_FLAG, is this "Y" for these 1133?

will treat missing as unkown

39

OE

acft_model_desc

odd naming conventions

used in forming aircraft groups. No longer needed in database. Keeping issue open to grouping is completed




40

OE

acft_obstn_code

Missing values and "0" coded values

What does missing mean? Also, the codebook has no definition of "0" but it appears 174 times. Note: of the zeroes, 18 have descriptions of obstructions

The valid obstructions themselves have little useful information, will treat as a "demographic" variable and move on

41

OE

acft_phase_code

17 missing values

12 missing do not have descriptions. The variable in general agrees with phaseofflight. Letting the 12 missing stay missing.

 

42

OE

acft_tcas_equip_code

21 missing values




variable dropped

43
















44

OE

ctlr_actn_contem_code

Incorrect coding

one value coded as "X", presumed to be a "Y"

variable dropped

45

OE

ctlr_actn_contem_code

missing

440 missing

variable dropped

46

OE

ctlr_actn_taken_code

missing

10 missing, same as ctlr_alert_code

variable dropped

47

OE

ctlr_alert_code

missing

6 missing do not have description. Demographic only variable, letting the 6 be missing

 

48

OE

ctlr_alert_otr_desc

missing

1336 missing, equal to those in ctlr_alert_code that are not "other"




49

OE

ctlr_area_spl_code

unclear meanings/codings

It's unclear what this variable is capturing in the first place, as sometimes this appears to list facilities (tower), locations (southwest), positions (LC1), areas (Area 7), or just a single number (6). There are 144 unique values in this field (though some are clearly the same with different abbreviations)

Data too inconsistent to use. Variable dropped

50

OE

ctlr_asst_req_flag

Y,N,Missing

27 Y, 12 missing, 1465 N

variable dropped

51

OE

ctlr_aware_dvlp_flag

Y,N,Missing

258 Y, 10 Missing, 1236 N

variable dropped

52

OE

ctlr_birth_date

Missing

479 Missing

199 missing in relevant data set, may have to exclude from some samples; recode one impossible value is missing, others over 61 are likely "grandfathered" in

53

OE

ctlr_certif_date

Missing

457 missing, a few wrong entires (7 years before birth, 2 years after birth). No entires prior to 1980, many certified 40+ years after birth

185 missing in relevant data set, significant followup needed

54

OE

ctlr_certif_type_ncode_oe

unclear meanings/codings

initial vs recerficiation?

"Controllers are initially certified. In pre-ATSAP days, certification could be revoked after an operational error (on one or all positions). Then recertification was required. Today, many events that might have required decertification are not (and instead handled via ATSAP process). " May have to take certifs "as is"

55

OE

ctlr_contrib_code

result of one-to-many merge

appears to have been a one-to-many merge from OE events to contributing factor codes, as a result, events appear in the db multiple times (i.e., once for each contributing code). We are not using the code in our model, so have dropped the field and removed duplicate lines




56

OE

ctlr_contrib_prev_30mo_qty

Missing

This is mutually exclusive with ctlr_prim_prev_30_mo_qty_oe. Treat missing as missing

 

57

OE

ctlr_curr_shft_end_time

Missing

456 missing

184 mising in relevant set, may have to exclude from some regressions

58

OE

ctlr_curr_shft_start_time

Missing

456 missing, same records missing as end time

184 mising in relevant set, may have to exclude from some regressions

59

OE

ctlr_dstrctn_flag

Y,N,Missing

200 Y, 15 Missing, 1289 N

variable dropped

60

OE

ctlr_fctr_med_certif_flag

Y,N,Missing

3 Y, 351 missing, 1150 N. may not be useful with so few Y

variable dropped

61

OE

ctlr_fpl_date

Missing

627 missing. The 4digit dates presented make no sense.

request clarification

62

OE

ctlr_perl_code

Missing

449 missing

181 missing in relevant data set, will have to exclude some

63

OE

ctlr_prev_shft_end_time

Missing

598 missing. 454 missing current end time. 2 missing current but have previous. 144 miss previous but have current

297 missing in relevant data set, will have to exclude those (plus "N/A" from some analysis)

64

OE

ctlr_prev_shft_start_time

Missing

598 missing, same missing as prev end time. 454 missing current start time. 2 missing current but have previous. 144 miss previous but have current

297 missing in relevant data set, will have to exclude those (plus "N/A" from some analysis)

65

OE

ctlr_prim_prev_30mo_qty

missing

528 missing

259 missing in relevant data set, need to understand relationship to contrib 30 month variable

66

OE

ctlr_psn_comb_desc

Missing

27 Missing. May require additional parsing to collapse in a usable categorical variable. Currently 642 unqiue values

will use the RI database instead, variable dropped

67

OE

ctlr_psn_fnctn_otr_desc

Missing

1464 Missing. Will need to parse better. 27 unique values

looks like people mistook this field for the "combined positions" description. Will drop variable


Download 2.66 Mb.

Share with your friends:
1   ...   21   22   23   24   25   26   27   28   ...   35




The database is protected by copyright ©ininet.org 2024
send message

    Main page