Education

  • Ph.D. in Computer Science, Northwestern University (Evanston, IL), Jul 2020 – May 2026 (expected).
  • M.S. in Analytics, Georgetown University (Washington, DC), Aug 2018 – May 2020.
  • B.S. in Biochemistry, University of Washington (Seattle, WA), Aug 2014 – Dec 2017.

Work Experience

  • Jun 2025 – Aug 2025: Data Scientist Intern, iSoftStone Inc
    • Designed a multimodal indoor localization framework integrating Wi-Fi RSSI fingerprinting with Fine Time Measurement (FTM) range estimation, using probabilistic sensor fusion (Bayesian filtering) alongside a graph neural network-based correction model to improve positioning accuracy.
    • Developed a data-driven, site-independent positioning model that leverages FTM’s resilience to multipath interference and utilized contrastive signal embedding to reduce reliance on dense fingerprint calibration, enabling robust cross-site generalization.
    • Built an experimental testbed and benchmarking suite for repeatable wireless positioning trials, enabling rigorous evaluation of indoor localization algorithms.
  • Jun 2024 – Aug 2024: Machine Learning Engineer Intern, AstraZeneca
    • Developed a domain-specific Retrieval-Augmented Generation (RAG) pipeline by integrating a FAISS-based dense retriever with a fine-tuned LLaMA-2 generator; created a custom Q&A dataset from internal medical documents via dual-encoder ranking and GPT-4 validation to fine-tune the system for contextually accurate, factually grounded answers.
    • Optimized persona-aware prompting by dynamically adjusting role/context embeddings (injecting learned “role vectors”) to bias the model’s internal activations.
    • Implemented hierarchical chunking strategies to maintain semantic coherence across long documents.
  • Aug 2018 – Sep 2019: Innovation Data Analyst, Office of Diversity & Inclusion
    • Optimized SQL queries for efficient demographic data collection and analysis, improving the tracking of medical student diversity and performance.
    • Conducted statistical analyses on intervention outcomes using t-tests and bootstrapping, providing actionable insights for program improvements.
    • Performed NLP-based sentiment analysis on survey responses to gauge participant feedback and identify sentiment trends.
    • Created interactive Tableau dashboards to visualize key metrics and presented findings to stakeholders, facilitating informed decision-making.
  • Mar 2018 – Aug 2018: Data Analyst Intern, iSoftStone Inc
    • Built and maintained Power BI dashboards to track Microsoft CELA team KPIs, enabling leadership to monitor performance indicators at a glance.
    • Developed ETL pipelines using Power Query and DAX to automate data integration from multiple sources, significantly reducing manual data processing time and effort.
    • Automated data collection and analysis tasks with Python and R, reducing manual effort and substantially increasing reporting efficiency.
    • Presented monthly analytics reports to senior stakeholders, translating data insights into strategic recommendations that informed high-level decision-making.

Technical Skills

  • Programming: Python, R, SQL, Java, C++, MATLAB, Arduino
  • Frameworks & ML Libraries: PyTorch, TensorFlow (Lite), DeepSpeed, HuggingFace, scikit-learn, Pandas, NumPy
  • Machine Learning: Large Language Models (OpenAI GPT-4, LangChain), Neural Architecture Search (NAS), TinyML, Computer Vision, Natural Language Processing
  • Data Tools: Power BI, Tableau, Azure ML Studio, Azure Cognitive Services, Hadoop/Spark
  • Statistical Analysis: Hypothesis Testing, Time Series Analysis, Experimental Design, Crowdsourcing Design