Data Engineer (M/F/D)
- Paris, France
- Department: Operations
- Skills / Job Strem Ref: Data Engineering
- Type of Contract: Permanent
Dailymotion is the leading video discovery destination & technology that learns about your tastes over time, constantly surfacing the best, most relevant content on the web. Our mission is to provide the best video user experience for consumers on the market, connecting publishers and advertisers to engaged viewers who turn to Dailymotion for their daily fix of the most compelling music, entertainment, news and sports content around.
Through partnerships with the world's leading publishers and content creators, France Télévisions, Le Parisien, CBS, Bein Sports, CNN, GQ, Universal Music Group, VICE and more, Dailymotion commands 3 billion monthly pageviews across its mobile app, desktop and connected TV experiences. Dailymotion is owned by Vivendi, one of the largest mass-media corporations in the world.
At Dailymotion, we‘re storytellers. We build the best place for people to enjoy the videos that matter. We do this through utilizing and developing cutting-edge technology and pushing the envelope to bring discoverable stories to life through premium content from the world’s best publishers. We do this by helping these publishers grow their audiences and monetize their content, their way.
Dailymotion is proud to be an equal employment opportunity and affirmative action employer. We value inclusion and we want you to help us thrive for a more diverse community.
Dailymotion is seeking a Data (Analytics) Engineer for the Analytics Engineering team.
You will join the Data Engineering & Machine Learning craft. A craft consists of multiple teams of engineers and machine learning experts who collaborate daily to create and run Data products in Dailymotion. Inside this craft, the Analytics Engineering team’s mission is to provide trustworthy and available data to enable analysis & insights throughout the company (B2C, B2B products, and business teams).
Analytics Engineering team builds and maintains products like our multi-petabyte data warehouse, event processors (at tens of thousands of messages per second), highly scalable client-facing analytics, data ingestion & distribution, synchronizing data across databases & systems, etc. The team is responsible for making costs-performance tradeoffs around data modeling & architecture. The team is also involved with training users of our data on SQL and analytics best practices and spearheading a significant effort around data governance.
Analytics Engineering is a new and emerging space within the Data sphere. As an Analytics Engineer, you bring a software engineering mindset, best practices to maintain analytics code, and to model data from its source to its use in the data warehouse as business and reporting data. It requires a mix of programming skills and data skills on a day-to-day basis. If you are interested in solving challenging business problems with your skills, consider applying to this role. Your impact will be broad and across all of Dailymotion’s businesses.
What you will do:
Collect vast amounts of raw data from internal sources and external sources in batch and streaming modes.
Expose the data through APIs, flat files, data marts, etc., for internal and external users.
Design Druid datasets for external facing consumers for speed, consistency, cost, and efficiency.
Write complex and optimal SQL queries to transform data in our data lake into reliable business entities and then into reporting aggregates. Identify dependencies for these transformations. Schedule these transformations through Airflow.
Investigate data discrepancy, data quality issues. Debug performance issues using query plan.
Design BigQuery table data model to efficiently answer business use cases considering cost and performance.
Ensure data is clean, consistent, and available. Perform data quality checks, create monitors.
Catalog and document the business entities, data marts, dimensions, metrics, business rules, etc.
Be a knowledge guide on the various business entities, data marts. Train users of our data on SQL and analytics best practices.
Come up with new tools, processes, documents and explore new tech during the cool-down periods.
- BS/MS in Computer Science, Engineering or related field
2+ years experience around Big Data, Data warehousing, writing complex SQL, and debugging complex SQL.
1+ years of experience developing and debugging software in Python.
Good business modeling skills: going from a stakeholder’s expressed requirements to an actual data model.
Ability to work with multiple stakeholders - Product, Engineers, Analysts, Product managers, DevOps, etc.
Comfortable working with Linux and the GCP stack
Experience with PubSub, Data flow, Data Processor, Airflow or Kafka, Spark, or other streaming technologies is a plus.
Experience in real-time analytics databases like Apache Druid is a plus.
Familiarity with NoSQL technologies such as Aerospike is a plus.
Writing and speaking proficiency in English
Technologies used by the team:
Google Cloud Platform (BigQuery, Cloud Storage, Beam/Dataflow, Compute Engine, etc), Python, GO, Airflow, SQL, Git, Java, JSON, Bash, Docker, Druid, Kubernetes, etc
At Dailymotion, we empower candidates to take action. If this job sounds like a great opportunity for you, be confident in your skills, we are always happy to meet you! If needed, we can accommodate our recruitment process for your special abilities.
Contract Type: Full-time permanent role
🏡 Remote Work Policy
💰 Saving Plan Vivendi
🍼 Paternity leave or Coparental leave extended
🕶️ Living Employee Culture (Events / Trainings / Partys / All hands / Dailymotion tradition…)
🚀 Career development support (training / internal mobility / compensation cycle / 360 quarter feedback review …)
🏥 High-end Health Insurance and Personal Services Vouchers (CESU)
⛱️ Paid Time off – RTT and Saving time plan (CET)
✅ Meal Vouchers – Public Transport and Bike refund
🎡European Economic and Social Committee (sport membership/cinemas vouchers/gift vouchers/discount)
🔍Want to learn more about us: