Hi, I'm Kristy Natasha Yohanes
I am an AI/ML developer who loves to immerse myself in technology and hands-on development: experimenting with machine learning, geospatial analysis, coding challenges, cloud automation, web development, and Raspberry Pi projects to explore data science applications within geoscience and environmental fields. To unwind, I find joy in the occasional thrill of wall climbing and archery during my free time.Here's my contact info, CV, and portfolio of my qualifications. You can download the PDF version of my portfolio (with certifications) below:
# ABOUT ME
I hold a Bachelor's degree from Institute of Technology Bandung (ITB), Indonesia.
My expertise lies in Python programming, specializing in ML, NLP, CV, time-series analysis, and graph networks.
I am also proficient in SQL, BI tools, GIS, VM, GCF, Docker, and K8s.
Projects & Case Studies
Throughout my academic and professional journey, I've actively participated in projects focused on developing forecasting models and conducting weather data analysis utilizing machine learning techniques, notably the hybrid ANN-ARIMA for predicting monsoonal patterns.Additionally, I contributed to community service initiatives by integrating the Weather Research and Forecasting (WRF) computational model to develop advanced flood risk assessment tools and designed user-centric web applications to distribute early warning alerts, enhancing proactive disaster mitigation strategies.
Achievements & Publications
Top 10 Scientific Paper Physics Fair Padjadjaran University
Top 10 Scientific Paper Economic Finance Study Club Diponegoro University
PUBLISHED: Propagation Characteristics of Madden Julian Oscillation in the Indonesian Maritime Continent: Case Studies for 2020-2022, Agromet Journal, doi: 10.29244/j.agromet.38.1.1-12PREPRINT / UNDER REVIEW: An Elementary Approach to Predicting Indonesian Monsoon Index: Combining Ann-Arima Hybrid Method and Practical Use
GitHub: Mock Projects
One of my enjoyable side projects I'm proud of is the YouTube video recommendation insights project. I utilized the YouTube Data API and NLP to analyze users' watch histories, uncovering insights into video recommendations. Key features include OAuth 2.0 integration, transcript analysis, keyword extraction, visualization, and data export capabilities.
Another one is on Customer Goods Data Modeling project that addresses industrial challenges through predictive modeling for daily sales quantity and customer segmentation. It utilizes PostgreSQL and DBeaver for data ingestion, Tableau Public for interactive dashboards, Python in Google Colab for predictive modeling, including time series ARIMA, and clustering techniques.
Professional Certifications
AML and Data Governance: Risk-Based Mentoring Program for Crimes of Money Laundering and Terrorism Financing in Human Trafficking and Financial Technology Crimes - PPATK 2024
Data Science: Certificate of Competencies - Kalbe Nutritionals Data Scientist Project Based Internship Program 2023
Full-Stack Development: Certificate of Competencies - BTPN Syariah Fullstack Developer Project Based Internship Program
Coursework Certifications
Google Cloud Professional Machine Learning Engineer Cert Prep: 1-6 - Google Developers
Artificial Intelligence on Microsoft Azure - Microsoft
The Full Stack - Meta
SQL for Data Science - UC Davis
Python for Data Science, AI & Development - IBM
Intro to Data Analytics - RevoU
Data Programming - Sololearn
Data Visualization ShortClass - MySkill
Intensive Bootcamp: Data Analysis - MySkill x Deloitte
Power BI Essential Training - LinkedIn Learning
Companies and Climate Change - ESSEC Business School
The Science of The Solar System - Caltech
The Evolving Universe - Caltech
WORK EXPERIENCE
• Model development:
- Implemented PCA and Isolation Forest for anomaly detection in merchant-customer transactions.
- Implemented Random Forest and Network Analysis to map illegal online gambling transactions.
- Developed semi-supervised Relational Graph Convolution Network (RGCN) model for collusion risk assessment.
- Developed a gradient boosting model that utilizes the (RGCN) model for collusion risk assessment.
• ML deployment: Utilized GCP cloud automation tools including Cloud Functions and Virtual Machine for model deployment.
• BAU: Utilized device intelligence to monitor tampering, cyberattacks, account takeovers, and computer vision for eKYC verification.
Utilized PostgreSQL in DBeaver for data exploration, developed interactive Tableau dashboards, employed ARIMA time-series regression in Python to estimate daily product quantities for inventory management, and applied K-Means marketing strategies and providing personalized promotions based on customer segmentations.
Designed and deployed a Flask-based recommendation system API incorporating NLP techniques such as topic modeling, named entity recognition, and sentiment analysis, utilizing Docker for containerization and Kubernetes for scalable deployment, enhancing personalized job matching between seekers and listers.
• Machine Learning Researcher - EST Research Group ITB
• Full-Stack Developer Internship - BTPN Syariah
• Front-End Developer Internship - Core Initiative Studio
• Data Science Research Internship - LAPAN/Indonesian National Aeronautics & Space Administration
• GIS Data Analyst Internship - Garda Caah
Tech Stacks | |
---|---|
Programming Languages: | Python, SQL, Go, Javascript, C++ |
Libraries/Tools: | ML (SciKit Learn, Keras, XGBoost, TensorFlow, PyTorch, OpenCV), NLP (NLTK, HuggingFace), Visualization (Matplotlib, Seaborn, Pandas, NumPy), Graph Networks (NetworkX, RGCNconv, HGTconv) |
Databases: | PostgreSQL, BigQuery |
Development: | IDEs (VS Code, DBeaver), Version Control (Git), Backend Frameworks (Flask, Django, Gin) |
Deployment: | VM, Cloud Functions, Docker, Kubernetes |