#PROJECT
CATEGORYSTATUS
01
Film Festival Tracker
Track documentary and experimental film festivals worldwide. Deadlines, fees, and acceptance rates, filterable by type and region.
view project →
Film
live
02
Analog Film Stock DatabaseCompare analog film stocks by price, availability, and format. Paired with a development lab directory searchable by process and location.view project →
Film
live
03
NYC Film Scene MapInteractive map of NYC screening spaces, workshops, labs, galleries, and community film spots. Filterable by category and borough.view project →
Film
live
04
AuteursPersonal index of filmmakers and performers, searchable by era, nationality, and style. Includes film guides and portraits.view project →
Film
live
05
NYC Food Help MapPublic-service directory of free food assistance organizations across NYC: pantries, soup kitchens, community meals, mobile food, and home delivery. Filterable, mapped, and volunteer-linked.view project →
Civic
live
06
NYC Adventure LogCurated NYC activities across seasons and neighborhoods. Cinema, Food, Arts, Explore, and more. Surprise picker included.view project →
City
live
07
Only Good ThingsCurated self-care activity tracker with filters for season, duration, mood, and setting. Tracks completions locally.view project →
Wellness
live
08
Cơm NhàVietnamese recipes from Miền Bắc, Miền Trung, and Miền Nam. Browse by region, search by ingredient.view project →
Food
live
09
Departure LoungeDestinations as boarding passes. Cities, food, and reasons to go.view project →
Travel
live
Data pipeline
featured projects
Data Pipelines & Client Dashboard Suite
2022–Present
Data infrastructure for a major enterprise marketing client. Ingests from 50+ sources into Snowflake, powering Tableau & Looker dashboards, automated reports, and ad-hoc pulls for 100+ clients. I own it end to end.
50+
sources
100+
clients
$2B+
ad spend
Raw Data Request Builder
2026
Internal tool for Client Services to query Snowflake and export data without going through engineering. Amplify frontend, Dockerized Lambda backend, parameterized queries, exports to CSV/Excel/JSON with automatic S3 staging and Google Drive delivery. Audit log written back to Snowflake on each request.
3
export formats
S3 + Drive
delivery
Zero-code
for end users
Anomaly Detection & Slack Alerting System
2022–Present
Monitors key metrics across Platform and Geo dimensions for Day-over-Day anomalies and sends Slack alerts when values go out of range. Thresholds live in a config table so they can be tuned per metric without a code change.
DoD
anomaly detection
Config-driven
thresholds
Platform × Geo
granularity
data engineering
Snowflake Warehouse Optimization
2023–2024
Years of ungoverned growth left the warehouse with duplicate views, inconsistent naming, and no ownership tracking. Audited 1,000+ tables and views, consolidated redundant logic, standardized naming, and removed unused objects. Compute costs dropped 30%.
30%
cost reduction
1,000+
tables audited
Google Sheets Data Pull Tool
2024
Self-serve reporting tool built inside Google Sheets. Dimension and metric selections trigger Apps Script and Matillion pipelines that generate Snowflake views and write results back to the sheet automatically. Cut turnaround time by 80%.
80%
less manual work
80%
faster turnaround
Custom API Ingestion & Data Processing
2022–2023
Python connectors for 6+ marketing platforms (Google Ads, CM360, Meta, DV360) handling OAuth, pagination, rate limiting, and Snowflake ingestion. Built modularly so adding a new source doesn't mean starting from scratch each time.
6+
platforms
days → hrs
onboarding
ESOV Media Spend Analysis
2022
Analysis for media budget planning using Kantar, Pathmatics, and Google Trends data: ESOV models, CDI/BDI scatter plots, and competitor salience benchmarks built in R. Delivered to brand teams for annual budget strategy.
3
data sources
Competitive
intel framework
Social Listening & Audience Insights
2022
Brandwatch dashboards for social listening across client brands. Python scripts to pull data, generate charts, and draft narratives for 10+ new business pitch decks.
10+
insights decks
Media Campaign Trafficking
2021–2022
Campaign operations across 20+ media vendors: IO setup, budget pacing, pixel trafficking, tag QA, and post-campaign analysis across digital, OOH, and print.
20+
vendors
Multi-channel
digital · OOH · print
academic · penn mcit
COVID & Property Analytics Platform
2025
Data processing platform combining Philadelphia COVID vaccination records, population by ZIP, and property assessments. MVC architecture with custom readers, Singleton logger for audit trails, and memoized processors caching expensive computations.
3
data sources
7+
test suites
Flu Tweet Geolocation Analyzer
2025
Analyzes geotagged tweets for flu-related content using regex pattern matching (handling edge cases like "fluent" vs "#flu"). Maps matches to the nearest U.S. state via Cartesian distance. Polymorphic Reader interface supports JSON and TXT formats.
50
states mapped
2
input formats
Student Management System
2025
Full-featured academic management system with role-based access control. Admins create/delete courses and users. Professors manage enrolled students and grades. Students view schedules, enroll/drop, and check grades. File-based persistence with comprehensive test coverage.
3
user roles
7
test suites
Movie Trivia Database Engine
2025
Movie database system implementing complex query operations without a traditional RDBMS. Supports actor-movie lookups, co-actor discovery, common movie searches, rating filters (critics vs. audience), and mean rating calculations.
8
query types
LC4 Assembler
2026
Mini-assembler in C that reads LC4 assembly text files and emits binary .obj machine code files consumable by PennSim. Implements a full two-pass parser: tokenizes instructions, encodes each opcode and operand into 16-bit binary, and handles .CODE/.DATA/.SYMBOL section headers with proper endianness.
30+
opcodes
2
file formats
Personal Portfolio Website
2025–Present
This site! Single-page portfolio with tabbed navigation, dark/light theme toggle, animated pipeline visualization, data studio aesthetic, and responsive layout. Built from scratch with vanilla HTML, CSS, and JS.
0
frameworks
2
themes
[01]
Lead Data Technology Engineer
- Partner with a major enterprise with $2B+ annual marketing spend to translate reporting needs into scalable data solutions, scoping requirements and delivery plans that align with technical capabilities and business goals.
- Architect scalable data models and ETL pipelines to ingest and transform data from 50+ data sources, powering multiple Looker dashboards that deliver actionable insights for enterprise stakeholders.
- Mentor junior engineers, establish standards & documentation to ensure operational efficiency across projects.
- Lead technical delivery across multiple reporting and automation initiatives, coordinating timelines, priorities, and releases to ensure accurate and timely stakeholder outcomes.
- Champion continuous improvement by identifying gaps in data tooling, proposing roadmap initiatives, and driving cross-functional alignment with Client Services and Data Technology.
50+sources
$2B+ad spend
20dashboards
10+automations
3engineers mentored
[02]
Senior Business Intelligence Engineer
- Maintained high-performance Tableau dashboards and Google Sheets automated reports, managing the full data cycle from ingestion to transformation, visualization, and stakeholder delivery.
- Drove client-facing projects in partnership with Client Services, clarifying requirements, scoping data sources, cleaning & transforming data, and delivering custom dashboards/reports for strategic decision-making.
- Architected automated Google Sheets workflows enabling one-click submission of inputs to trigger Apps Script and Matillion jobs that generate SQL views and write outputs back to designated tabs, reducing manual work and turnaround time by 80%.
- Led large-scale refactoring of 50%+ Snowflake data views, improving query performance and maintainability.
- Oversaw daily monitoring, investigation, and resolution of data issues to ensure accuracy & reliability of BI deliverables.
- Facilitated agile delivery as Scrum Master, driving backlog grooming, brainstorming, and project execution.
80%faster turnaround
50%+views refactored
100+clients
[03]
Business Intelligence Engineer
- Developed backend SQL views for automated marketing reports for 100+ cross-agency clients.
- Designed ETL pipelines via Matillion to ingest data from emails, SFTP, S3 into Snowflake (1TB of data monthly).
- Developed Python scripts to ingest JSON responses from various APIs, employing Pandas for data cleanup.
- Established automated email and Slack alert systems, reducing issue resolution times by 40%.
- Troubleshot 200+ JIRA tickets for Client Services teams on data discrepancies and integration issues.
- Cleaned up Snowflake database with over 1,000 tables/views, achieving a 30% reduction in warehouse costs.
- Created and optimized Tableau dashboards to provide clients with data-driven decision-making capabilities.
- Wrote detailed documentation on Confluence to improve operational clarity and cross-team collaboration.
1TBmonthly
100+clients
30%cost cut
200+tickets
40%faster resolution
[04]
Data Analyst
- Ingested data from multiple platforms and APIs into Domo, maintained dashboards/datasets/ETL dataflow pipelines, created interactive visualizations that transform data into actionable performance insights.
- Handled ad hoc requests from clients and internal teams and troubleshot data collection, taxonomy naming convention & match table issues.
- Built Brandwatch queries/dashboards and created 10+ social listening, audience insights decks for new business pitches.
- Automated the insights writing process and custom charts generation with Python on Jupyter Notebooks (NumPy, Pandas, Matplotlib).
- Wrote SQL queries to set up API connectors (BigQuery) and special calculated fields on Domo.
- Used R to create Excess Share of Voice (ESOV) charts correlating media spend (Kantar, Pathmatics) with Google Trends search indexes.
- Created CDI/BDI scatter plot charts to identify potential markets for budget planning and radar charts to analyse salience scores by brand.
- Performed QA on floodlight tags, tracking URLs and UTM parameters using Google Analytics.
- Provided insights for brand health studies and assisted in the development and measurement of A/B testing.
- Assisted in the development of reach/frequency predictive models across all media channels.
50+dashboards
10+pitch decks
5+API integrations
[05]
Media Coordinator
- Managed relationships with 20+ media vendors (Video, Audio, Display, OOH, Print) per billing actualization, reporting and pacing processes.
- Managed partners' weekly spend and monthly insights reports, kept track of daily spends, KPIs measurement and benchmarks.
- Wrote monthly performance metrics insights and cross-checked with pre-established benchmarks (CTR, VCR, CPM, CPC, Reach/Frequency).
- Managed Google Ads account, built search media plan, adjusted pacings and developed strategies for optimizations (SEO).
- Created new campaigns & placements on Prisma/MediaOcean and trafficking sheets for new placements on Campaign Manager 360 using taxonomy matrix.
- Created screenshot decks, creative rotation trackers and competitive spend analysis using Kantar, MRI Simmons & Commspoint.
- Assisted with Media Mix Model strategy & tactical planning, budget prediction and channel allocations.
- Assisted with researching new audience groups, interests, keywords and creating new ads and adjusting pacing on Facebook & Pinterest Ads.
20+vendors
8+media channels
anna_db>