Skip to content

Barderus/UFO-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


📊 Exploratory Data Analysis

  • Cleaned and standardized city, state, country, and time fields.
  • Explored frequency of shapes, durations, and reporting trends.
  • Chi-square test & Cramér’s V used to assess shape correlations by location.

🗺️ Geospatial Analysis

Using GeoPandas, sightings were plotted on a global basemap to explore density by region.

UFO World Map


🕒 Temporal Trends

Analyzed yearly, monthly, weekday, and hourly sighting patterns.

  • Most sightings occur in summer, especially July.
  • Reports spike between 8 PM and midnight.
  • Sightings peaked around 1997–2003.

Sightings Per Year


💬 Natural Language Processing (NLP)

Applied text cleaning, sentiment analysis, and word cloud generation on over 80,000 sighting descriptions.

  • Majority of reports are neutral or mildly positive.
  • Language tends to be objective, focusing on observation.
  • Word cloud of most common terms below:

UFO Word Cloud


📌 Tools Used

  • Python (Pandas, NumPy, Matplotlib, Seaborn, GeoPandas)
  • NLP: TextBlob, wordcloud, nltk
  • Geospatial: geopandas, shapely, geodatasets
  • Jupyter Notebooks

🚀 Possible Future Work

  • Topic modeling on sighting descriptions
  • Time-series forecasting or anomaly detection
  • Interactive dashboard (e.g., with Plotly Dash or Streamlit)

📄 License

MIT


🤝 Acknowledgments

Data from NUFORC via Kaggle.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors