Hi, I'm Kristy Natasha Yohanes

I am an AI/ML developer who loves to immerse myself in technology and hands-on development: experimenting with machine learning, geospatial analysis, coding challenges, cloud automation, web development, and Raspberry Pi projects to explore data science applications within geoscience and environmental fields. To unwind, I find joy in the occasional thrill of wall climbing and archery during my free time.Here's my contact info, CV, and portfolio of my qualifications. You can download the PDF version of my portfolio (with certifications) below:

# ABOUT ME

I hold a Bachelor's degree from Institute of Technology Bandung (ITB), Indonesia.

My expertise lies in Python programming, specializing in ML, NLP, CV, time-series analysis, and graph networks.

I am also proficient in SQL, BI tools, GIS, VM, GCF, Docker, and K8s.

Projects & Case Studies

Throughout my academic and professional journey, I've actively participated in projects focused on developing forecasting models and conducting weather data analysis utilizing machine learning techniques, notably the hybrid ANN-ARIMA for predicting monsoonal patterns.Additionally, I contributed to community service initiatives by integrating the Weather Research and Forecasting (WRF) computational model to develop advanced flood risk assessment tools and designed user-centric web applications to distribute early warning alerts, enhancing proactive disaster mitigation strategies.

Achievements & Publications

  • Top 10 Scientific Paper Physics Fair Padjadjaran University

  • Top 10 Scientific Paper Economic Finance Study Club Diponegoro University

PUBLISHED: Propagation Characteristics of Madden Julian Oscillation in the Indonesian Maritime Continent: Case Studies for 2020-2022, Agromet Journal, doi: 10.29244/j.agromet.38.1.1-12PREPRINT / UNDER REVIEW: An Elementary Approach to Predicting Indonesian Monsoon Index: Combining Ann-Arima Hybrid Method and Practical Use

GitHub: Mock Projects

One of my enjoyable side projects I'm proud of is the YouTube video recommendation insights project. I utilized the YouTube Data API and NLP to analyze users' watch histories, uncovering insights into video recommendations. Key features include OAuth 2.0 integration, transcript analysis, keyword extraction, visualization, and data export capabilities.

Another one is on Customer Goods Data Modeling project that addresses industrial challenges through predictive modeling for daily sales quantity and customer segmentation. It utilizes PostgreSQL and DBeaver for data ingestion, Tableau Public for interactive dashboards, Python in Google Colab for predictive modeling, including time series ARIMA, and clustering techniques.

Professional Certifications

  • AML and Data Governance: Risk-Based Mentoring Program for Crimes of Money Laundering and Terrorism Financing in Human Trafficking and Financial Technology Crimes - PPATK 2024

  • Data Science: Certificate of Competencies - Kalbe Nutritionals Data Scientist Project Based Internship Program 2023

  • Full-Stack Development: Certificate of Competencies - BTPN Syariah Fullstack Developer Project Based Internship Program

Coursework Certifications

  • Google Cloud Professional Machine Learning Engineer Cert Prep: 1-6 - Google Developers

  • Artificial Intelligence on Microsoft Azure - Microsoft

  • The Full Stack - Meta

  • SQL for Data Science - UC Davis

  • Python for Data Science, AI & Development - IBM

  • Intro to Data Analytics - RevoU

  • Data Programming - Sololearn

  • Data Visualization ShortClass - MySkill

  • Intensive Bootcamp: Data Analysis - MySkill x Deloitte

  • Power BI Essential Training - LinkedIn Learning

  • Companies and Climate Change - ESSEC Business School

  • The Science of The Solar System - Caltech

  • The Evolving Universe - Caltech

WORK EXPERIENCE

• Model development:
- Implemented PCA and Isolation Forest for anomaly detection in merchant-customer transactions.
- Implemented Random Forest and Network Analysis to map illegal online gambling transactions.
- Developed semi-supervised Relational Graph Convolution Network (RGCN) model for collusion risk assessment.
- Developed a gradient boosting model that utilizes the (RGCN) model for collusion risk assessment.
• ML deployment: Utilized GCP cloud automation tools including Cloud Functions and Virtual Machine for model deployment.
• BAU: Utilized device intelligence to monitor tampering, cyberattacks, account takeovers, and computer vision for eKYC verification.

Utilized PostgreSQL in DBeaver for data exploration, developed interactive Tableau dashboards, employed ARIMA time-series regression in Python to estimate daily product quantities for inventory management, and applied K-Means marketing strategies and providing personalized promotions based on customer segmentations.

Designed and deployed a Flask-based recommendation system API incorporating NLP techniques such as topic modeling, named entity recognition, and sentiment analysis, utilizing Docker for containerization and Kubernetes for scalable deployment, enhancing personalized job matching between seekers and listers.

• Machine Learning Researcher - EST Research Group ITB
• Full-Stack Developer Internship - BTPN Syariah
• Front-End Developer Internship - Core Initiative Studio
• Data Science Research Internship - LAPAN/Indonesian National Aeronautics & Space Administration
• GIS Data Analyst Internship - Garda Caah

Tech Stacks 
Programming Languages:Python, SQL, Go, Javascript, C++
Libraries/Tools:ML (SciKit Learn, Keras, XGBoost, TensorFlow, PyTorch, OpenCV), NLP (NLTK, HuggingFace), Visualization (Matplotlib, Seaborn, Pandas, NumPy), Graph Networks (NetworkX, RGCNconv, HGTconv)
Databases:PostgreSQL, BigQuery
Development:IDEs (VS Code, DBeaver), Version Control (Git), Backend Frameworks (Flask, Django, Gin)
Deployment:VM, Cloud Functions, Docker, Kubernetes