Natural Language Processing (NLP)
Designing text pipelines, retrieval-augmented generation (RAG), classification models, and guardrails for reliable, production-ready applications.
Aspiring Data Scientist & Machine Learning Engineer passionate about turning data into measurable impact. Experienced in developing NLP pipelines, forecasting models, recommender systems, and deploying production-grade ML solutions with MLOps and AWS. Strong foundation in data engineering and visualization for clear, actionable insights.
Data Scientist & ML Engineer
I am a Software Engineering (AI) graduate and aspiring Data Scientist & Machine Learning Engineer, currently seeking opportunities to apply my skills in real-world projects. I am passionate about building end-to-end ML solutions — from data preparation and feature engineering to model training, deployment, and monitoring — with a focus on scalability, reliability, and measurable outcomes.
My expertise spans Python, SQL, C++, and PySpark for data science; TensorFlow, PyTorch, and XGBoost for modeling; and AWS, Docker, and CI/CD for cloud & MLOps. I also bring strengths in NLP, forecasting, recommender systems, and deep learning, alongside experience in visualization tools such as Tableau, Power BI, and Streamlit to deliver actionable insights.
My objective is to contribute to teams where I can solve business problems with data-driven solutions, improve decision-making through ML systems, and continue growing into a well-rounded Data Scientist & ML Engineer.
Sydney, NSW, Australia
Full-time in Australia
Designing text pipelines, retrieval-augmented generation (RAG), classification models, and guardrails for reliable, production-ready applications.
Building probabilistic and deep learning models (e.g., TFT) with disciplined feature engineering to improve demand planning and reduce error rates.
Developing collaborative and session-based models, leveraging feature stores and evaluation frameworks to boost engagement and CTR.
Deploying ML models with Docker, FastAPI, and CI/CD; monitoring performance in production and scaling with AWS (SageMaker, Glue, Redshift).
Building reliable pipelines with Pandas, PySpark, and dbt; ensuring clean, high-quality data for analytics and ML systems.
Turning data into decisions with Tableau, Power BI, and Streamlit; delivering insights through clear, impactful narratives.
End-to-end ML with a product mindset — depth in modeling, breadth across data & MLOps.
Ingested & analysed 23K+ e-commerce reviews, reducing manual feedback triage by 80% through automated sentiment analysis, topic modelling, and drift monitoring.
Unstructured reviews slowed insights; hidden pain-points and fit issues driving returns.
Built NLP pipelines (clean → sentiment → topic); visualised trends & co-occurrence; drift detection with monitoring KPIs.
−80% manual analysis, 7 customer themes surfaced, faster product decisions (+15–20%), ~10–12% reduction in returns, ~8–10% retention uplift.
Unified demand forecasting, inventory optimisation, and risk intelligence into an interactive Streamlit platform; reduced stockouts 15–30% and improved forecast reliability via P10/P50/P90 fan charts.
Stockouts & excess holding costs from unreliable forecasts and limited scenario planning.
Implemented diverse forecasting models; Monte Carlo (200–1000) for (s, S) policies; schema-validated Walmart data; event-driven S&OP risk KPIs.
−15–30% stockouts, 99%+ data integrity, faster executive decisions via auto-generated summaries, improved developer reproducibility with CLI tools & modular repo.
Collaborative filtering in Python/Flask; Top-5 accuracy 87%; lightweight API for serving.
Low attach rate; poor personalization.
Neighborhood CF + implicit feedback; simple service for A/B tests.
+CTR on related items; faster discovery of long-tail products.
Hybrid retrieval + cross-encoder re-rank; guardrails; FRT −41%. Faithfulness & grounding eval.
Slow, inconsistent answers across support docs.
BM25+dense (RRF), cross-encoder; PII redaction & refusal policies.
−41% first response time; higher CSAT on complex queries.
Churn rising; reporting slow & error-prone due to inconsistent data quality across sources.
Torrens University Australia
QIBA — Sydney • Jan 2024 – Dec 2024
Australian workplace-ready program focused on ICT practice, communication and industry placement.
Analytical thinking and applied ML with Python.
Open to DS/ML roles & collaborations. Sydney • AU work rights.
I am eager to apply my skills in Data Science and Machine Learning to real-world challenges. With a background in Software Engineering (AI) and hands-on experience across NLP, forecasting, recommender systems, and MLOps, I focus on building solutions that are reliable, scalable, and deliver measurable impact.
I am actively seeking opportunities to contribute as a Data Scientist / ML Engineer. If you are hiring or open to collaboration, I’d be glad to connect and discuss how my expertise can add value to your team or project.