Hengyuan (David) Liu

Hi! Welcome to Hengyuan(David)'s website! đź‘‹


About Me

About Me

I am pursuing a Master's in Applied Statistics and Data Science at the University of California, Los Angeles. I'm passionate about theory, inference, and applying Statistics to real-world problems. UCLA Statistics & Data Science Profile


Education

      University of California, Los Angeles
      Master's Degree, Applied Statistics and Data Science
      Expected: Dec. 2024

      University of California, Davis
      Bachelor's Degree, Statistics
      Jun. 2023


Work Experience

Clinical Programmer (Summer Intern) at Kite Pharma

JUN. 2024 - AUG. 2024

  • Performed data analysis and statistical modeling on clinical trial datasets using Python and SAS, enhancing data quality and integrity
  • Automated the replacement of placeholders in Word documents with content from RTF files using Python, pypandoc, and python-docx
  • Collaborated with clinical operations, data management, and biometrics to ensure accurate data handling and reporting
  • Implemented Python programs to facilitate data management workflows to streamline clinical operations, increasing trial efficiency

  • Reader (Part-Time) at UCLA

    JAN. 2024 - MAR. 2024

  • Facilitated learning and provided detailed feedback on weekly assignments in STATS 20 boosting students' statistical concepts, data analysis, and R programming techniques.
  • Collaborated with the teaching team to develop effective teaching strategies, ensuring a comprehensive learning experience

  • Coming Soon...

    Month. Year - Month. Year

  • Future work experience to be placed here.
  •  


    Skills

    R, Python, Excel, SQL, SAS, C++, AWS Database, LaTeX, HTML, CSS, Office 365


    Toolkit

    R Studio, Visual Studio, Jupyter Notebook, Tableau, SAS Studio, MySQL workbench, AWS Database Overleaf, Excel, Office 365


    Selected Works

    1. Analysis of Newborn Names from NYC

    MAY. 2024 - JUN. 2024

    Read more about this project (open in new window)

     

    2. Moment Generating Functions for Univariate and Multivariate Distributions and Their Use in Distribution Theory

    FEB. 2024 - MAR. 2024

    Read more about this project (open in new window)

     

    3. Analysis of Ten Minutes of Trauma Sence Time Presentation

    Nov. 2023 - Dec. 2023

    Read more about this project PowerPoint (open in new window)

    Read more about this project R code in HTML (open in new window)

     

    The Video Presentation of Analysis of Ten Minutes of Trauma Sence Time

     

    4. Model Selection and Validation Project

    Sep. 2023 - Oct. 2023

    Read more about this project (open in new window)

     

    Selected Works-UCD

    1. Multivariate Regression in Diabetes Data Analysis

    April. 2023 - May. 2023

    Read more about this project (open in new window)

     

    2. Analysis of Shot Marilyns by Andy Warhol

    Read more about this project (open in new window)

    Mar. 2023 - May. 2023

     

    3. Analysis of factors associated with Heart Disease / Stroke

    Read more about this project (open in new window)

    Feb. 2023 - Mar. 2023

     

    4. 2022 - 2023 H5N1 Bird Flu Modeling and Prediction in the United States

    Read more about this project (open in new window)

    Jan. 2023 - Mar. 2023

  • This is a project for the International Association of Statistical Computing (IASC) Data Analysis Competition 2023.
  • Worked with two students from UC Davis, Weilin Cheng, and Kathy Mo, and two students from UMich, Sida Tian and Li Yuan.
  • Built and compared statistical models that predict which county in the upcoming month will have H5N1 case(s).
  • The prediction accuracy of the model is 98.4% with AUC 0.8015.
  • Provided some suggestions including collecting more related data such as egg production and breeding size to improve the model.
  • Proposed some advice on how to control the spread of the H5N1 virus.
  • The team won 2nd place and a travel grant for attending the World Statistics Conference (WSC) 2023 organized by the International Statistical Institute.
  •  

    5. Movie Rating Predictions Based on Reviews

    Read more about this project (open in new window)

    Feb. 2023 - Mar. 2023

  • This is the final project for the Advanced Statistical Computing class at UC Davis.
  • Worked with four UC Davis students: Josh Balingit, Luc Chen, Kathy Mo, and Ka Wai Sit.
  • Created different machine learning models from the IMDB Kaggle dataset and compared them.
  • Evaluated which algorithms most effectively predict whether a movie receives a positive rating or not.
  •  

    6. California 2022 Proposition 30 Feasibility Report and Recommendations

    Read more about this project (open in new window)

    Oct. 2022 - Nov. 2022

  • This is a project for the CA 2022 Election Data Challenge hosted by UC Davis.
  • Worked with two students from UC Davis, Weilin Cheng and Henyuan Liu, and one student from UMich, Li Yuan.
  • Figured out the relationship between multiple factors and Greenhouse Gas emission by light-duty vehicles in California.
  • Found out the relationship between fire control funding and Greenhouse Gas emitted by wildfire.
  • Concluded that only performing Proposition 30 is not enough to reach California’s carbon emission goal.
  • Suggested focusing more on carbon emitted by trucks, cargo ships, industrial pollution, air conditioners, public transportation, planes, etc.
  • The team won the nomination prize.
  •  

    7. Top 1000 and Bottom 1000 Movie User Score and Gross Income Data Analysis

    Read more about this project (open in new window)

    Nov. 2022 - Dec. 2022

  • This is the final project for the Statistical Data Technologies class at UC Davis.
  •  

    8. Time Series Analysis of Annual Temperature Anomalies

    Read more about this project (open in new window)

    Nov. 2022 - Dec. 2022

  • This is the final project for the Applied Time Series Analysis class at UC Davis.
  • Worked with two students from UC Davis, Weilin Cheng and Hengyuan Liu.
  •  


    Honors and Awards

    Item Date
    Second Place of the International Association of Statistical Computing Data Analysis Competition (open in new window)       March 2023
    Dean’s Honor List for Fall Quarter 2022 at UC Davis       January 2023
    Honorable Mention CA Election 2022 Data Challenge       November 2022

    Certifications

    Item Date
    Introduction to Front-End Development       Aug. 24, 2023
    Tableau Essential Training Aug. 11, 2023
    Python Data Structures and Algorithms Aug. 4, 2023
    SQL for Data Analysis       Dec. 23, 2022
    Python for Time Series Data Analysis       Sep. 15, 2022

    Contact Information