Yao L. 2024 | BASIS Independent Fremont
- Project Title: Exploratory Data Analysis using machine learning on water quality (Temperature, Dissolved Oxygen, Ph, ORP) using machine learning
- BASIS Independent Advisor: Mr. Andrew Magee
- Internship Location: UCSD, Engineering Dept.
- Onsite Mentor: Dr. J Garza
The goal of my senior project is to write a research paper by drawing predictions on various types of water quality data from multiple lakes. I will focus on temperature, dissolved oxygen, pH, ORP, and store them in a structured database while removing any inconsistencies or redundancy. I intend to use Exploratory Data Analysis to visualize trends and understand correlations between the different parameters. By combining Exploratory Data Analysis with machine learning algorithms, I will make predictions based on the following four assumptions: Linearity Residuals, Independence Assumption, Constant Variance Assumptions(Homoscedasticity), and Normality Assumption. In addition, I will use multiple graphs and different statistical tests to support and verify these assumptions. For example, I hope to interpret QQ-plot and Shapiro-Wilk test to check Homoscedasticity. From these assumptions, I aim to make predictions on enhancing water quality, thus creating positive impacts on the environment, preventing waterborne diseases and securing clean water for drinking and agriculture. The project will display as a research paper with all the proper procedures.