~/anna-vu
SELECT *  FROM  anna_vu.creative_projects  ORDER BY year  DESC -- personal projects · 9 rows returned
WHERE project_name LIKE '% %'
AND category =
#PROJECT CATEGORYSTATUS
01
Film Festival Tracker Track documentary and experimental film festivals worldwide. Deadlines, fees, and acceptance rates, filterable by type and region. view project →
Film
live
02
Analog Film Stock DatabaseCompare analog film stocks by price, availability, and format. Paired with a development lab directory searchable by process and location.view project →
Film
live
03
NYC Film Scene MapInteractive map of NYC screening spaces, workshops, labs, galleries, and community film spots. Filterable by category and borough.view project →
Film
live
04
AuteursPersonal index of filmmakers and performers, searchable by era, nationality, and style. Includes film guides and portraits.view project →
Film
live
05
NYC Food Help MapPublic-service directory of free food assistance organizations across NYC: pantries, soup kitchens, community meals, mobile food, and home delivery. Filterable, mapped, and volunteer-linked.view project →
Civic
live
06
NYC Adventure LogCurated NYC activities across seasons and neighborhoods. Cinema, Food, Arts, Explore, and more. Surprise picker included.view project →
City
live
07
Only Good ThingsCurated self-care activity tracker with filters for season, duration, mood, and setting. Tracks completions locally.view project →
Wellness
live
08
Cơm NhàVietnamese recipes from Miền Bắc, Miền Trung, and Miền Nam. Browse by region, search by ingredient.view project →
Food
live
09
Departure LoungeDestinations as boarding passes. Cities, food, and reasons to go.view project →
Travel
live
SELECT project_name, company, year, stack  FROM  anna_vu.projects  ORDER BY year  DESC -- 15 rows · engineering + academic
Data pipeline
INGEST APIs · SFTP · Platforms · S3 TRANSFORM Python · Matillion · Snowflake SQL VISUALIZE Looker · Tableau · Reporting ANALYZE Performance · Budget · Anomalies
Data Pipelines & Client Dashboard Suite
2022–Present
MatillionSnowflakePythonS3BigQueryLookerTableauDomoSQL
Data infrastructure for a major enterprise marketing client. Ingests from 50+ sources into Snowflake, powering Tableau & Looker dashboards, automated reports, and ad-hoc pulls for 100+ clients. I own it end to end.
50+
sources
100+
clients
$2B+
ad spend
Raw Data Request Builder
2026
AWS AmplifyLambdaDockerECRSnowflakePythonS3Google Drive APIAWS SSM
Internal tool for Client Services to query Snowflake and export data without going through engineering. Amplify frontend, Dockerized Lambda backend, parameterized queries, exports to CSV/Excel/JSON with automatic S3 staging and Google Drive delivery. Audit log written back to Snowflake on each request.
3
export formats
S3 + Drive
delivery
Zero-code
for end users
Anomaly Detection & Slack Alerting System
2022–Present
SQLPythonSnowflakeMatillionSlack API
Monitors key metrics across Platform and Geo dimensions for Day-over-Day anomalies and sends Slack alerts when values go out of range. Thresholds live in a config table so they can be tuned per metric without a code change.
DoD
anomaly detection
Config-driven
thresholds
Platform × Geo
granularity
data engineering
Snowflake Warehouse Optimization
2023–2024
SnowflakeSQL
Years of ungoverned growth left the warehouse with duplicate views, inconsistent naming, and no ownership tracking. Audited 1,000+ tables and views, consolidated redundant logic, standardized naming, and removed unused objects. Compute costs dropped 30%.
30%
cost reduction
1,000+
tables audited
Google Sheets Data Pull Tool
2024
Apps ScriptMatillionSnowflakeGoogle SheetsSQL
Self-serve reporting tool built inside Google Sheets. Dimension and metric selections trigger Apps Script and Matillion pipelines that generate Snowflake views and write results back to the sheet automatically. Cut turnaround time by 80%.
80%
less manual work
80%
faster turnaround
Custom API Ingestion & Data Processing
2022–2023
PythonPandasREST APIsSnowflake
Python connectors for 6+ marketing platforms (Google Ads, CM360, Meta, DV360) handling OAuth, pagination, rate limiting, and Snowflake ingestion. Built modularly so adding a new source doesn't mean starting from scratch each time.
6+
platforms
days → hrs
onboarding
ESOV Media Spend Analysis
2022
RKantarPathmaticsGoogle Trends
Analysis for media budget planning using Kantar, Pathmatics, and Google Trends data: ESOV models, CDI/BDI scatter plots, and competitor salience benchmarks built in R. Delivered to brand teams for annual budget strategy.
3
data sources
Competitive
intel framework
Social Listening & Audience Insights
2022
BrandwatchPythonJupyter
Brandwatch dashboards for social listening across client brands. Python scripts to pull data, generate charts, and draft narratives for 10+ new business pitch decks.
10+
insights decks
Media Campaign Trafficking
2021–2022
Prisma/MediaOceanCampaign Manager 360Google Ads
Campaign operations across 20+ media vendors: IO setup, budget pacing, pixel trafficking, tag QA, and post-campaign analysis across digital, OOH, and print.
20+
vendors
Multi-channel
digital · OOH · print
academic · penn mcit
COVID & Property Analytics Platform
2025
JavaCSV/JSONMVCMemoizationJUnit
Data processing platform combining Philadelphia COVID vaccination records, population by ZIP, and property assessments. MVC architecture with custom readers, Singleton logger for audit trails, and memoized processors caching expensive computations.
3
data sources
7+
test suites
Flu Tweet Geolocation Analyzer
2025
JavaRegexGeolocationSingletonFactory
Analyzes geotagged tweets for flu-related content using regex pattern matching (handling edge cases like "fluent" vs "#flu"). Maps matches to the nearest U.S. state via Cartesian distance. Polymorphic Reader interface supports JSON and TXT formats.
50
states mapped
2
input formats
Student Management System
2025
JavaOOPHashMapAuthJUnit 5
Full-featured academic management system with role-based access control. Admins create/delete courses and users. Professors manage enrolled students and grades. Students view schedules, enroll/drop, and check grades. File-based persistence with comprehensive test coverage.
3
user roles
7
test suites
Movie Trivia Database Engine
2025
JavaArrayListFile I/OJUnit
Movie database system implementing complex query operations without a traditional RDBMS. Supports actor-movie lookups, co-actor discovery, common movie searches, rating filters (critics vs. audience), and mean rating calculations.
8
query types
LC4 Assembler
2026
CBinary I/OISAFile ParsingMakefile
Mini-assembler in C that reads LC4 assembly text files and emits binary .obj machine code files consumable by PennSim. Implements a full two-pass parser: tokenizes instructions, encodes each opcode and operand into 16-bit binary, and handles .CODE/.DATA/.SYMBOL section headers with proper endianness.
30+
opcodes
2
file formats
Personal Portfolio Website
2025–Present
HTMLCSSJavaScriptSVG
This site! Single-page portfolio with tabbed navigation, dark/light theme toggle, animated pipeline visualization, data studio aesthetic, and responsive layout. Built from scratch with vanilla HTML, CSS, and JS.
0
frameworks
2
themes
SELECT role, company, period, wins  FROM  anna_vu.experience  ORDER BY start_date  DESC -- 5 records
[01]
Lead Data Technology Engineer
DEPT Agency · New York, NY · Aug 2025 – Present
MatillionSnowflakePythonLookerSQLS3
  • Partner with a major enterprise with $2B+ annual marketing spend to translate reporting needs into scalable data solutions, scoping requirements and delivery plans that align with technical capabilities and business goals.
  • Architect scalable data models and ETL pipelines to ingest and transform data from 50+ data sources, powering multiple Looker dashboards that deliver actionable insights for enterprise stakeholders.
  • Mentor junior engineers, establish standards & documentation to ensure operational efficiency across projects.
  • Lead technical delivery across multiple reporting and automation initiatives, coordinating timelines, priorities, and releases to ensure accurate and timely stakeholder outcomes.
  • Champion continuous improvement by identifying gaps in data tooling, proposing roadmap initiatives, and driving cross-functional alignment with Client Services and Data Technology.
50+sources
$2B+ad spend
20dashboards
10+automations
3engineers mentored
[02]
Senior Business Intelligence Engineer
DEPT Agency · New York, NY · Jun 2024 – Aug 2025
TableauApps ScriptMatillionSnowflakeSQLPython
  • Maintained high-performance Tableau dashboards and Google Sheets automated reports, managing the full data cycle from ingestion to transformation, visualization, and stakeholder delivery.
  • Drove client-facing projects in partnership with Client Services, clarifying requirements, scoping data sources, cleaning & transforming data, and delivering custom dashboards/reports for strategic decision-making.
  • Architected automated Google Sheets workflows enabling one-click submission of inputs to trigger Apps Script and Matillion jobs that generate SQL views and write outputs back to designated tabs, reducing manual work and turnaround time by 80%.
  • Led large-scale refactoring of 50%+ Snowflake data views, improving query performance and maintainability.
  • Oversaw daily monitoring, investigation, and resolution of data issues to ensure accuracy & reliability of BI deliverables.
  • Facilitated agile delivery as Scrum Master, driving backlog grooming, brainstorming, and project execution.
80%faster turnaround
50%+views refactored
100+clients
[03]
Business Intelligence Engineer
DEPT Agency · New York, NY · Oct 2022 – May 2024
SQLMatillionPythonPandasSnowflakeTableauREST APIsSlack API
  • Developed backend SQL views for automated marketing reports for 100+ cross-agency clients.
  • Designed ETL pipelines via Matillion to ingest data from emails, SFTP, S3 into Snowflake (1TB of data monthly).
  • Developed Python scripts to ingest JSON responses from various APIs, employing Pandas for data cleanup.
  • Established automated email and Slack alert systems, reducing issue resolution times by 40%.
  • Troubleshot 200+ JIRA tickets for Client Services teams on data discrepancies and integration issues.
  • Cleaned up Snowflake database with over 1,000 tables/views, achieving a 30% reduction in warehouse costs.
  • Created and optimized Tableau dashboards to provide clients with data-driven decision-making capabilities.
  • Wrote detailed documentation on Confluence to improve operational clarity and cross-team collaboration.
1TBmonthly
100+clients
30%cost cut
200+tickets
40%faster resolution
[04]
Data Analyst
FIG · New York, NY · Jul 2022 – Oct 2022
DomoPythonSQLBigQueryBrandwatchJupyterR
  • Ingested data from multiple platforms and APIs into Domo, maintained dashboards/datasets/ETL dataflow pipelines, created interactive visualizations that transform data into actionable performance insights.
  • Handled ad hoc requests from clients and internal teams and troubleshot data collection, taxonomy naming convention & match table issues.
  • Built Brandwatch queries/dashboards and created 10+ social listening, audience insights decks for new business pitches.
  • Automated the insights writing process and custom charts generation with Python on Jupyter Notebooks (NumPy, Pandas, Matplotlib).
  • Wrote SQL queries to set up API connectors (BigQuery) and special calculated fields on Domo.
  • Used R to create Excess Share of Voice (ESOV) charts correlating media spend (Kantar, Pathmatics) with Google Trends search indexes.
  • Created CDI/BDI scatter plot charts to identify potential markets for budget planning and radar charts to analyse salience scores by brand.
  • Performed QA on floodlight tags, tracking URLs and UTM parameters using Google Analytics.
  • Provided insights for brand health studies and assisted in the development and measurement of A/B testing.
  • Assisted in the development of reach/frequency predictive models across all media channels.
50+dashboards
10+pitch decks
5+API integrations
[05]
Media Coordinator
FIG · New York, NY · Jun 2021 – Jun 2022
Prisma/MediaOceanCampaign Manager 360Google AdsKantar
  • Managed relationships with 20+ media vendors (Video, Audio, Display, OOH, Print) per billing actualization, reporting and pacing processes.
  • Managed partners' weekly spend and monthly insights reports, kept track of daily spends, KPIs measurement and benchmarks.
  • Wrote monthly performance metrics insights and cross-checked with pre-established benchmarks (CTR, VCR, CPM, CPC, Reach/Frequency).
  • Managed Google Ads account, built search media plan, adjusted pacings and developed strategies for optimizations (SEO).
  • Created new campaigns & placements on Prisma/MediaOcean and trafficking sheets for new placements on Campaign Manager 360 using taxonomy matrix.
  • Created screenshot decks, creative rotation trackers and competitive spend analysis using Kantar, MRI Simmons & Commspoint.
  • Assisted with Media Mix Model strategy & tactical planning, budget prediction and channel allocations.
  • Assisted with researching new audience groups, interests, keywords and creating new ads and adjusting pacing on Facebook & Pinterest Ads.
20+vendors
8+media channels
CONNECT TO  anna_vu.profile  -- SQL & Python · type a query below
anna_db> 
anna_vu :: snowflake UTF-8 studio 9 rows | 2.1ms
PUMKIN
z z Z