Journalism and SEASR

Skip to end of metadata
Go to start of metadata
  1. Title of Study/Project

  2. List of Team Members and Their Affiliation

    • Brant Houston
    • Jennifer LaFleur
    • Jaimi Dowdell
    • Chase Davis
    • David Donald
  3. SEASR Staff Contact

  4. Procedural Outline of Study/Project

    1. Research Question/Purpose of Study

    2. Data Source

      1. lists.def

        1. Gazeteers
          • People
            1. Aviation Medical Examiner
            2. Cabin Attendant
            3. Captain
            4. Certified Flight Instructor
            5. Civil Air Patrol
            6. Company Ground Personnel
            7. Engineer
            8. Federal Air Marshal
            9. First Officer
            10. Flight Attendant
            11. Flight Crew
            12. Flight Director
            13. Flight Engineer
            14. Instructor
            15. International Relief Officer
            16. Manager
            17. Mechanic
            18. Passenger
            19. Pilot
            20. Pilot Flying
            21. Pilot Not Flying
            22. Second Officer
            23. Supervisor
          • Measurements
            1. Celsius
            2. Decibel
            3. Degree
            4. Fahrenheit
            5. Feet Per Minute
            6. Feet/Foot
            7. Gigahertz
            8. Ground Speed
            9. Height
            10. Hertz
            11. Kilogram
            12. Kilohertz
            13. Kilometers
            14. Kilowatt
            15. Knot(S)
            16. Megahertz
            17. Mile
            18. Minute(S)
            19. Outside Air Temperature
            20. Pound
            21. Revolutions Per Minute
            22. Sea Level
            23. Speed
            24. Statute Mile
            25. Takeoff Gross Weight
            26. Temperature
            27. True Air Speed
            28. Weight
          • Direction
            1. North
            2. Northbound
            3. South
            4. Southbound
            5. East
            6. Eastbound
            7. West
            8. Westbound
          • Weather
            1. Crosswind
            2. Precipitation
            3. Overcast
            4. Thunderstorm
            5. Turbulence
            6. Visibility
            7. Weather
          • Plane_Parts
            1. Air Data Computer
            2. Aircraft
            3. Attitude Direction Indicator
            4. Auxiliary Power Unit
            5. Carburetor
            6. Cathode Ray Tube
            7. Circuit Breaker
            8. Control Display Unit
            9. Differential Global Positioning System
            10. Digital Flight Data Recorder
            11. Digital Flight Guidance System
            12. Distance Measuring Equipment
            13. Electronic Flight Instrument System
            14. Engine
            15. Flight Control Unit
            16. Head Up Display
            17. Power Control Unit
            18. Power
            19. Propeller
            20. Radar Approach Control
            21. Radio Direction Magnetic Indicator
            22. Radio Magnetic Indicator
            23. Stabilizer
          • Takeoff
            1. Takeoff
            2. Top Of Climb
            3. Pre Departure Clearance
            4. Estimated Time Of Departure
            5. Departure
          • Landing
            1. Landing
            2. Descent
            3. Descend
            4. Top Of Descent
            5. Estimated Time Of Arrival
            6. Inbound
            7. Touchdown Zone
            8. Approach
            9. Arrival
            10. Arrive
          • Incidents
            1. Near Midair Collision
            2. Rejected Takeoff
            3. Missed Approach Point
            4. Landing Without Clrnc
            5. Foreign Object Damage
            6. Distraction
          • Plane_SW_Systems
            1. Automatic Communications Addressing & Reporting System
            2. Automatic Landing System
            3. Automatic Weather Observing/Reporting System
            4. Autopilot and Flight Director System
            5. Aviation Safety Reporting System
            6. Digital Flight Guidance System
            7. Electronic Flight Instrument System
            8. Flight Management Guidance System
            9. Flight Management System
            10. Global Navigation System
            11. Global Positioning System
            12. Ground Proximity Warning System
            13. Instrument Landing System
            14. Inertial Navigation System
            15. Inertial Reference System/Unit
            16. Traffic Alert and Collision Avoidance System
    1. Analysis Tools

  1. Activity Timeline or Milestones

  2. Report or Project Outcome(s)

  3. Ideas on what your team needs from SEASR staff to help you achieve your goal.

  4. Team Communication Plans

Labels:
None
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.
  1. Mar 08, 2009

    Chase has gotten quite a bit of SEC text from its administrative proceedings and we need to know if you want him to just send all the pdf files and let Xavier look at plan to parse or put them in a database.

    Please send thoughts and instructions.

  2. Jun 24, 2009

    Everyone has gotten through conferences and many projects. I am going through the list of follow up items.

    I believe David Donald is waiting for a dataset from Bernie so that David can use a full search query of that dataset for key words.

  3. Jun 26, 2009

    From our wish list:

    We still would like to be able to run PDFs thru the flows without converting to txt.

    Once we get the file from Bernie, we need to rn analytics to come up with patterns to search for in full data sets. Or can we do that already on sample?

    Other entitities we'd like to extract:  date, money, person, organization, time, percentage...

  4. Jun 26, 2009

    Yes. My notes from May say that Bernie would help with getting the SEC data scaped from its Website. In the meantime, as a first step in exploring our text, I would follow up with doing full-text indexing so we could narrow and focus what we're hunting for with Meandre's help. Since full-text indexing was new to me in May, I've brushed up on that skill and am now ready to procede. And will post here what I learn, as promised to Bernie, when we can start with the SEC data. Again, from our conversation with Bernie, he prefers that it be done in MySQL, and so will do it MySQL.

    1. Jul 01, 2009

      I have created a dump of the data that was imported into the instance of mysql which we had prepared for our workshop session. I will e-mail this file (in a gzip compressed format 7.5M). This file should give you the (1) SEC download we acquired while your were here in the table SEC, (2) sampledata has content as supplied previously (larger textual content), (3) sampledata_replaced has some of the encoding anomolies correct, and(4) bb_msg table which has very short comment sytle entries.

  5. Jul 27, 2009

    This is a followup from the call with Brant, Bernie and myself on July 9 before my two-week vacation. We have four areas we're concentrating on next.

    First, Brant and David would check with SEC policy regarding a web scrape of crawl of its enforcement texts at http://www.sec.gov/divisions/enforce.shtml. After scouring the I could find no policy. Brant called the SEC press office, learning that they have no specific policy in this area because it's public data and can be accessed and acquired however public users want.

    Second, Bernie will do a web scrape or crawl of the SEC enforcement texts, concentrating on Federal Court and administrative actions. When completed, this will give us the core texts for the next steps.

    Third, when Bernie is finished, he will notify us how to get these texts. I will import this into MySQL and set up a full-text index. I will make this available to the whole team so that we can next do free text queries to narrow those SEC cases/texts we're most interested, based on our keywords and concepts.

    Fourth, Bernie will point us to a repository of components so that we may beging to build on the basic set in the default Meandre and to move toward the different techniques and goals we set with Loretta in May.

    1. Jul 27, 2009

      Thanks David. Wanted to be even more specific about the SEC conversation. The SEC public affairs office said it "encourages" citizens to download and use the data however they need to. If there are any questions on this, please don't hesitate to talk with me or email me.

      Also, it appears to me that it would keep everyone focused and active if we set up a schedule on when to accomplish certain activities. It is July 27th and things are only going to get busier on the journalism front as August and September approach.

      Loretta, is the schedule something you and I should do?

      Thanks

      Brant

Add Comment