Title of Study/Project
List of Team Members and Their Affiliation
- Brant Houston
- Jennifer LaFleur
- Jaimi Dowdell
- Chase Davis
- David Donald
SEASR Staff Contact
Procedural Outline of Study/Project
Research Question/Purpose of Study
Data Source
lists.def
- Gazeteers
- People
- Aviation Medical Examiner
- Cabin Attendant
- Captain
- Certified Flight Instructor
- Civil Air Patrol
- Company Ground Personnel
- Engineer
- Federal Air Marshal
- First Officer
- Flight Attendant
- Flight Crew
- Flight Director
- Flight Engineer
- Instructor
- International Relief Officer
- Manager
- Mechanic
- Passenger
- Pilot
- Pilot Flying
- Pilot Not Flying
- Second Officer
- Supervisor
- Measurements
- Celsius
- Decibel
- Degree
- Fahrenheit
- Feet Per Minute
- Feet/Foot
- Gigahertz
- Ground Speed
- Height
- Hertz
- Kilogram
- Kilohertz
- Kilometers
- Kilowatt
- Knot(S)
- Megahertz
- Mile
- Minute(S)
- Outside Air Temperature
- Pound
- Revolutions Per Minute
- Sea Level
- Speed
- Statute Mile
- Takeoff Gross Weight
- Temperature
- True Air Speed
- Weight
- Direction
- North
- Northbound
- South
- Southbound
- East
- Eastbound
- West
- Westbound
- Weather
- Crosswind
- Precipitation
- Overcast
- Thunderstorm
- Turbulence
- Visibility
- Weather
- Plane_Parts
- Air Data Computer
- Aircraft
- Attitude Direction Indicator
- Auxiliary Power Unit
- Carburetor
- Cathode Ray Tube
- Circuit Breaker
- Control Display Unit
- Differential Global Positioning System
- Digital Flight Data Recorder
- Digital Flight Guidance System
- Distance Measuring Equipment
- Electronic Flight Instrument System
- Engine
- Flight Control Unit
- Head Up Display
- Power Control Unit
- Power
- Propeller
- Radar Approach Control
- Radio Direction Magnetic Indicator
- Radio Magnetic Indicator
- Stabilizer
- Takeoff
- Takeoff
- Top Of Climb
- Pre Departure Clearance
- Estimated Time Of Departure
- Departure
- Landing
- Landing
- Descent
- Descend
- Top Of Descent
- Estimated Time Of Arrival
- Inbound
- Touchdown Zone
- Approach
- Arrival
- Arrive
- Incidents
- Near Midair Collision
- Rejected Takeoff
- Missed Approach Point
- Landing Without Clrnc
- Foreign Object Damage
- Distraction
- Plane_SW_Systems
- Automatic Communications Addressing & Reporting System
- Automatic Landing System
- Automatic Weather Observing/Reporting System
- Autopilot and Flight Director System
- Aviation Safety Reporting System
- Digital Flight Guidance System
- Electronic Flight Instrument System
- Flight Management Guidance System
- Flight Management System
- Global Navigation System
- Global Positioning System
- Ground Proximity Warning System
- Instrument Landing System
- Inertial Navigation System
- Inertial Reference System/Unit
- Traffic Alert and Collision Avoidance System
- People
- Gazeteers
Labels:
None
Page:
lists.def
7 Comments
comments.show.hide-
-
Permalink
-
-
Permalink
-
-
Permalink
-
-
Permalink
-
-
Permalink
-
-
Permalink
-
-
Permalink
Add CommentMar 08, 2009
Joseph Brantley Houston
Chase has gotten quite a bit of SEC text from its administrative proceedings and we need to know if you want him to just send all the pdf files and let Xavier look at plan to parse or put them in a database.
Please send thoughts and instructions.
Jun 24, 2009
Joseph Brantley Houston
Everyone has gotten through conferences and many projects. I am going through the list of follow up items.
I believe David Donald is waiting for a dataset from Bernie so that David can use a full search query of that dataset for key words.
Jun 26, 2009
Jennifer LaFleur
From our wish list:
We still would like to be able to run PDFs thru the flows without converting to txt.
Once we get the file from Bernie, we need to rn analytics to come up with patterns to search for in full data sets. Or can we do that already on sample?
Other entitities we'd like to extract: date, money, person, organization, time, percentage...
Jun 26, 2009
David Donald
Yes. My notes from May say that Bernie would help with getting the SEC data scaped from its Website. In the meantime, as a first step in exploring our text, I would follow up with doing full-text indexing so we could narrow and focus what we're hunting for with Meandre's help. Since full-text indexing was new to me in May, I've brushed up on that skill and am now ready to procede. And will post here what I learn, as promised to Bernie, when we can start with the SEC data. Again, from our conversation with Bernie, he prefers that it be done in MySQL, and so will do it MySQL.
Jul 01, 2009
Bernie Ács
I have created a dump of the data that was imported into the instance of mysql which we had prepared for our workshop session. I will e-mail this file (in a gzip compressed format 7.5M). This file should give you the (1) SEC download we acquired while your were here in the table SEC, (2) sampledata has content as supplied previously (larger textual content), (3) sampledata_replaced has some of the encoding anomolies correct, and(4) bb_msg table which has very short comment sytle entries.
Jul 27, 2009
David Donald
This is a followup from the call with Brant, Bernie and myself on July 9 before my two-week vacation. We have four areas we're concentrating on next.
First, Brant and David would check with SEC policy regarding a web scrape of crawl of its enforcement texts at http://www.sec.gov/divisions/enforce.shtml
. After scouring the I could find no policy. Brant called the SEC press office, learning that they have no specific policy in this area because it's public data and can be accessed and acquired however public users want.
Second, Bernie will do a web scrape or crawl of the SEC enforcement texts, concentrating on Federal Court and administrative actions. When completed, this will give us the core texts for the next steps.
Third, when Bernie is finished, he will notify us how to get these texts. I will import this into MySQL and set up a full-text index. I will make this available to the whole team so that we can next do free text queries to narrow those SEC cases/texts we're most interested, based on our keywords and concepts.
Fourth, Bernie will point us to a repository of components so that we may beging to build on the basic set in the default Meandre and to move toward the different techniques and goals we set with Loretta in May.
Jul 27, 2009
Joseph Brantley Houston
Thanks David. Wanted to be even more specific about the SEC conversation. The SEC public affairs office said it "encourages" citizens to download and use the data however they need to. If there are any questions on this, please don't hesitate to talk with me or email me.
Also, it appears to me that it would keep everyone focused and active if we set up a schedule on when to accomplish certain activities. It is July 27th and things are only going to get busier on the journalism front as August and September approach.
Loretta, is the schedule something you and I should do?
Thanks
Brant