This portfolio is a compilation of notebooks and projects I created for data analysis or for exploration of machine learning algorithms.
This project was contracted by Water Climate Trust. They were preparing for work at the 2018 and 2019 United Nations Climate Change Conferences (COP24 and COP25). The objective was to use the Clean Development Mechanism (CDM) project data available through the UN to uncover patterns and issues with the carbon market trading stystem. I did initial research and gathered the raw data, then I cleaned the data to make it easier to use. I then analyzed it and created data visualizations.
Part 1 of the data cleaning can be found here
Part 2 of the data cleaning and analysis can be found here
This was a group project for a graduate data science course taken at UC Davis. We obtained bike share usage data, including stations used, trip duration, and member status for 3 years: 2014, 2015, and 2016. We compared this data to weather data, neighborhood data (crime, income), and proximity of bike share stations to Caltrain stations. Our goal was to gain insights into the new bike share program in order to inform future expansion efforts.
These are problems, projects, and assignments I worked on while taking graduate level coursework.
See homework assignment with both conceptual and data analysis problems here.
See homework assignment with both conceptual and data analysis problems here.
See homework assignment with both conceptual and data analysis problems here.
See homework assignment with both conceptual and data analysis problems here.
See homework assignment with both conceptual and data analysis problems here.
This project involved scraping websites for data and gathering multiple data sources. We then chose our method of analysis and assessed its effectiveness. See the project here for more detail. Supporting notebooks are here and here.
These assignments showcase skills such as image processing, web scraping, data cleaning, working with geospatial data, and data analysis. Assignment 1, Assignment 2, Assignment 3, Assignment 4, Assignment 5, Assignment 6
Fitting a Model for Abalone Age
Logistic Regression Model to Predict Probability of House Sparrow Survival
Mathematical Statistics (2 courses)
Applied Statistics (2 courses)
Probability Theory
Machine Learning
Data Science
Computational Statistics
Topological Data Analysis
Computability, Complexity,and Algorithms
Machine Learning