Yong Sook Prasit Attavit

Logo

DS/ BI Developer

View My GitHub Profile

Certified Tableau Desktop Specialist

Technical Skills: Tibco Spotfire, Tableau, PowerBI, SQL, SQL Server Management Studio, PostgreSQL, Python, Excel with VBA, JMP Pro, LaTeX

Soft Skills: Project Management, Leadership, Effective Communication & Presentation

Education & Relevant Certifications

Personal Projects

Loan Defaulter Machine Learning Classification [Python, scikit-learn, Applied Machine Learning]

GitHub

Business Case: Develop a Machine Learning model for the company to identify if the loan applicant is likely or unlikely to default on the loan, thereby assisting in the company’s portfolio & risk assessment capabilities

Exploratory Data Analysis [EDA], feature-engineering & model testing was conducted using Python within the JupyterNotebook environment. Various feature engineering techniques & models were experimented & the best AUC scoring model was then selected & applied to form the final model.

The final model, along with the feature-engineered dataframe, acheived a ~90% predictive AUC score after cross-validation via the Kaggle platform, thus demonstrating a strong ablity to discriminate between loan applicants whom are likely or unlikely to default on loans.

Loan Defaulter Machine Learning Classification Project

Shopee Code League: Logistics [Python]

GitHub

Background: Amidst the global COVID-19 pandemic, the surge in online orders on Shopee has heightened the importance of on-time delivery, a critical factor in e-commerce success. To meet this demand, Shopee has partnered with top-performing logistics providers across the region, enforcing SLAs and penalties to ensure timely deliveries. This rigorous monitoring and accountability process reinforces Shopee’s commitment to delivering goods punctually, enhancing user confidence.

Task: Identify all the orders that are considered late depending on the Service Level Agreements (SLA) with Shopee’s Logistics Provider.

Shopee Code League: Logistics

COVID-19_SQL_Tableau_Dashboard [T-SQL, SQL Server Management Studio, Tableau]

GitHub

Background: This project focuses on visualizing COVID-19 infection and deaths using data obtained from ‘Our World in Data’ with Tableau. It aims to provide a visual representation of the detrimental impact of COVID-19 on a global scale & within the SEA region.

COVID-19_SQL_Tableau_Dashboard_Global_Impact

COVID-19_SQL_Tableau_Dashboard_Global_Impact

Hotel Revenue Analysis [PowerBI, PowerQuery M, T-SQL, SQL Server Management Studio, Excel, PowerPoint]

GitHub

Business Case: A business manager of a chain of hotel would like to request help from the data analytics team to develop a dashboard to analyze & visualize hotel booking data

Requirement: Build a Dashboard using PowerBI to provide at-a-glance information about features that are relevant to revenue

Data from raw .csv file was ingested into SQL Server Management Studio [SSMS], from which SQL queries were iteratively built upon to obtain the final output which was then imported into SSMS for data analysis & visualization. Additional data processing was done in PowerBI with PowerQuery M for visualization purposes.

Guests, revenue seasonality, recommendation for staffing arrangement, and dataset limitations were explained with the help of additional visualizations done in PowerBI in the accompanying Powerpoint slides.

Hotel Revenue Analysis

Analysis of Departmental Salary Disparities [Tableau, Python, T-SQL, SQL Server Management Studio, Excel, Powerpoint]

GitHub

Business Case: The data analytics manager of a company would like to seek insights into salary disparities present within the company department

Objective:

Deliverables:

Data from raw .csv file was ingested into SQL Server Management Studio [SSMS], from which SQL queries were iteratively built upon to obtain the final output which included the departmental:

  1. Standard Deviation
  2. Average Salary
  3. Coefficient Of Variation
  4. Outlier Count based off Z-Score values

The top 5 departments (along with reasons for selection) were flagged out for review, based on the calculated output. Further elaboration is done in the accompanying JupyterNotebook and Powerpoint Slides.

Analysis of Departmental Salary Disparities

Analysis of Credit Card Churning Customers [Tableau, Powerpoint]

GitHub

Analysis of attritted bank customers were performed based on various qualitative & quantitative measures using Tableau for data visualization.

Relevant recommendations that the bank can undertake to alleviate churning customers were proposed. Limitations of the dataset which may lead to analytics bias was also discussed. Further elaboration is done in the accompanying Powerpoint Slides.

Analysis of Credit Card Churning Customers with Tableau

Numerical Investigation of Pressure Drop at Turbulent Flow Conditions in Single Pellet String Reactors [Computational Fluid Dynamics, LaTeX, OpenFOAM®, ParaView, Linux, C++]

GitHub Repo

Achieved distinction for thesis.

Application: C++ Programming was used in conjunction with OpenFOAM® in a Linux Distribution [Ubuntu] where modeling, mesh generation, and turbulent fluid simulations was conducted.

ParaView is used as both the GUI and post-processing tool to extract pressure drop, along with other parameters of interest. LaTeX was used for relevant graph plotting and the aggregation of all findings into a presentable format.

Numerical Investigation of Pressure Drop at Turbulent Flow Conditions in Single Pellet String Reactors

Work Experience

Data Scientist @ NUHS (Nov 2023 - Present)

Process & Equipment Engineer II @ Micron Semiconductor Operations Asia (Dec 2022 - Feb 2023)

Process & Equipment Engineer I @ Micron Semiconductor Operations Asia (Dec 2021 - Dec 2022)

Manufacturing Engineer @ Systems on Silicon Manufacturing Company Pte Ltd (SSMC) (Jan 2021 – Jul 2021)

Courses