#PROJECT
ENTRIESTECHSTATUS
01
Film Festival Tracker
Documentary and experimental film festivals worldwide. Deadlines, fees, acceptance rates — filterable by type and region to plan smarter submissions.
view project →
89 festivals
30 countries
30 countries
HTML · JSON
Vanilla JS
Vanilla JS
live
02
Analog Film Stock DatabaseFilm stocks with price tracking and availability, plus development labs by location and pricing.view project →
54 stocks · 25 labs
HTML · JSON · Vanilla JS
live
03
NYC Film Scene MapNew York's experimental film world on one map: screening spaces, workshops, labs, galleries, community spots.view project →
100 venues · 5 categories
HTML · JSON · Leaflet.js
live
04
AuteursA personal index of 100 filmmakers I admire, searchable and filterable by style tags.view project →
100 filmmakers · 4 eras
HTML · JSON
live
05
Only Good ThingsSelf-care activities filterable by season, duration, and setting — with completion tracking.view project →
143 activities · 8 categories
HTML · JSON · Vanilla JS
live
06
Cơm NhàVietnamese recipes from Miền Bắc, Miền Trung, and Miền Nam. Browse by region, search by ingredient.view project →
175 recipes · 3 regions
HTML · JSON · Vanilla JS
live
07
Departure LoungeDestinations as boarding passes — cities, food, and reasons to go.view project →
50 countries · 5 regions
HTML · JSON · Vanilla JS
live
Data pipeline
/* data systems, pipelines, and tools — built professionally and academically to solve real problems. */
professional
Enterprise ETL Pipeline Architecture
DEPT Agency · 2022–Present
Architected the data warehouse and ETL pipelines to ingest 30+ sources into Snowflake, powering Looker dashboards for campaign performance and budget allocation.
30+
sources
1TB
monthly
$2B+
ad spend
Google Sheets Data Pull Tool
DEPT Agency · 2024
Self-serve data pull tool for Client Services. Dimension and metric selections in Google Sheets trigger Apps Script and Matillion pipelines that generate Snowflake views and write results back to the sheet.
80%
less manual work
80%
faster turnaround
Real-Time Slack Alerting Pipeline
DEPT Agency · 2023
Evaluates marketing data against thresholds and pushes formatted alerts to client Slack channels when anomalies are detected.
40%
faster resolution
Real-time
alerting
Snowflake Warehouse Optimization
DEPT Agency · 2023–2024
Refactored 50%+ of data views, standardized naming conventions, consolidated duplicate logic, and documented ownership across the warehouse.
30%
cost reduction
1,000+
tables audited
Custom API Ingestion & Data Processing
DEPT Agency · 2022–2023
Reusable Python connectors for marketing APIs handling auth, pagination, rate limiting, and Pandas transformation before loading into Snowflake.
6+
APIs integrated
Client Reporting & Dashboard Suite
DEPT Agency · 2022–2024
SQL views powering automated reports and Tableau dashboards for 100+ clients. Full lifecycle from requirements through development and maintenance.
100+
clients
200+
tickets resolved
ESOV Media Spend Analysis
FIG Agency · 2022
ESOV charts correlating media spend with Google Trends data. CDI/BDI scatter plots for market identification and radar charts for brand salience.
3
chart types
Social Listening & Audience Insights
FIG Agency · 2022
Brandwatch social listening dashboards across client brands. Automated insights writing and chart generation with Python for 10+ pitch decks.
10+
insights decks
Media Campaign Trafficking
FIG Agency · 2021–2022
Campaign setup on Prisma/MediaOcean, trafficking sheets for CM360, Google Ads management, and optimization strategies across 20+ vendors.
20+
vendors
academic
COVID & Property Analytics Platform
UPenn, CIT 5940 (Group) · 2025
Data processing platform combining Philadelphia COVID vaccination records, population by ZIP, and property assessments. MVC architecture with custom readers, Singleton logger for audit trails, and memoized processors caching expensive computations.
3
data sources
7+
test suites
Flu Tweet Geolocation Analyzer
UPenn, CIT 5940 (Solo) · 2025
Analyzes geotagged tweets for flu-related content using regex pattern matching (handling edge cases like "fluent" vs "#flu"). Maps matches to the nearest U.S. state via Cartesian distance. Polymorphic Reader interface supports JSON and TXT formats.
50
states mapped
2
input formats
Student Management System
UPenn, CIT 5910 (Group) · 2025
Full-featured academic management system with role-based access control. Admins create/delete courses and users. Professors manage enrolled students and grades. Students view schedules, enroll/drop, and check grades. File-based persistence with comprehensive test coverage.
3
user roles
7
test suites
Movie Trivia Database Engine
UPenn, CIT 5910 · 2025
Movie database system implementing complex query operations without a traditional RDBMS. Supports actor-movie lookups, co-actor discovery, common movie searches, rating filters (critics vs. audience), and mean rating calculations.
8
query types
LC4 Assembler
UPenn, CIT 5930 (Solo) · 2026
Mini-assembler in C that reads LC4 assembly text files and emits binary .obj machine code files consumable by PennSim. Implements a full two-pass parser: tokenizes instructions, encodes each opcode and operand into 16-bit binary, and handles .CODE/.DATA/.SYMBOL section headers with proper endianness.
30+
opcodes
2
file formats
Personal Portfolio Website
Personal · 2025–Present
This site! Single-page portfolio with tabbed navigation, dark/light theme toggle, animated pipeline visualization, data studio aesthetic, and responsive layout. Built from scratch with vanilla HTML, CSS, and JS.
0
frameworks
2
themes
[01]
Lead Data Technology Engineer
- Partner with a major enterprise with $2B+ annual marketing spend to translate reporting needs into scalable data solutions, scoping requirements and delivery plans that align with technical capabilities and business goals.
- Architect scalable data models and ETL pipelines to ingest and transform data from 30+ data sources, powering multiple Looker dashboards that deliver actionable insights for enterprise stakeholders.
- Mentor junior engineers, establish standards & documentation to ensure operational efficiency across projects.
- Lead technical delivery across multiple reporting and automation initiatives, coordinating timelines, priorities, and releases to ensure accurate and timely stakeholder outcomes.
- Champion continuous improvement by identifying gaps in data tooling, proposing roadmap initiatives, and driving cross-functional alignment with Client Services and Data Technology.
30+sources
$2B+ad spend
20dashboards
10+automations
5engineers mentored
[02]
Senior Business Intelligence Engineer
- Maintained high-performance Tableau dashboards and Google Sheets automated reports, managing the full data cycle from ingestion to transformation, visualization, and stakeholder delivery.
- Drove client-facing projects in partnership with Client Services, clarifying requirements, scoping data sources, cleaning & transforming data, and delivering custom dashboards/reports for strategic decision-making.
- Architected automated Google Sheets workflows enabling one-click submission of inputs to trigger Apps Script and Matillion jobs that generate SQL views and write outputs back to designated tabs, reducing manual work and turnaround time by 40%.
- Led large-scale refactoring of 50%+ Snowflake data views, improving query performance and maintainability.
- Oversaw daily monitoring, investigation, and resolution of data issues to ensure accuracy & reliability of BI deliverables.
- Facilitated agile delivery as Scrum Master, driving backlog grooming, brainstorming, and project execution.
40%faster turnaround
50%+views refactored
100+clients
[03]
Business Intelligence Engineer
- Developed backend SQL views for automated marketing reports for 100+ cross-agency clients.
- Designed ETL pipelines via Matillion to ingest data from emails, SFTP, S3 into Snowflake (1TB of data monthly).
- Developed Python scripts to ingest JSON responses from various APIs, employing Pandas for data cleanup.
- Established automated email and Slack alert systems, reducing issue resolution times by 40%.
- Troubleshot 200+ JIRA tickets for Client Services teams on data discrepancies and integration issues.
- Cleaned up Snowflake database with over 1,000 tables/views, achieving a 30% reduction in warehouse costs.
- Created and optimized Tableau dashboards to provide clients with data-driven decision-making capabilities.
- Wrote detailed documentation on Confluence to improve operational clarity and cross-team collaboration.
1TBmonthly
100+clients
30%cost cut
200+tickets
40%faster resolution
[04]
Data Analyst
- Ingested data from multiple platforms and APIs into Domo, maintained dashboards/datasets/ETL dataflow pipelines, created interactive visualizations that transform data into actionable performance insights.
- Handled ad hoc requests from clients and internal teams and troubleshot data collection, taxonomy naming convention & match table issues.
- Built Brandwatch queries/dashboards and created 10+ social listening, audience insights decks for new business pitches.
- Automated the insights writing process and custom charts generation with Python on Jupyter Notebooks (NumPy, Pandas, Matplotlib).
- Wrote SQL queries to set up API connectors (BigQuery) and special calculated fields on Domo.
- Used R to create Excess Share of Voice (ESOV) charts correlating media spend (Kantar, Pathmatics) with Google Trends search indexes.
- Created CDI/BDI scatter plot charts to identify potential markets for budget planning and radar charts to analyse salience scores by brand.
- Performed QA on floodlight tags, tracking URLs and UTM parameters using Google Analytics.
- Provided insights for brand health studies and assisted in the development and measurement of A/B testing.
- Assisted in the development of reach/frequency predictive models across all media channels.
50+dashboards
10+pitch decks
5+API integrations
[05]
Media Coordinator
- Managed relationships with 20+ media vendors (Video, Audio, Display, OOH, Print) per billing actualization, reporting and pacing processes.
- Managed partners' weekly spend and monthly insights reports, kept track of daily spends, KPIs measurement and benchmarks.
- Wrote monthly performance metrics insights and cross-checked with pre-established benchmarks (CTR, VCR, CPM, CPC, Reach/Frequency).
- Managed Google Ads account, built search media plan, adjusted pacings and developed strategies for optimizations (SEO).
- Created new campaigns & placements on Prisma/MediaOcean and trafficking sheets for new placements on Campaign Manager 360 using taxonomy matrix.
- Created screenshot decks, creative rotation trackers and competitive spend analysis using Kantar, MRI Simmons & Commspoint.
- Assisted with Media Mix Model strategy & tactical planning, budget prediction and channel allocations.
- Assisted with researching new audience groups, interests, keywords and creating new ads and adjusting pacing on Facebook & Pinterest Ads.
20+vendors
8+media channels
anna_db>