Case Study: Enterprise data lake on cloud

Industry

Media

Offering

Cloud Advisory Services

Workload

DC, Servers & Data

Cloud

AWS

Project scope

— Data lake architecture design
— Data transformation and storage in data lake
— Customized reports in PowerBI

About the client

The client is a leading media production and broadcasting company, subsidiary of a global media conglomerate. They have over 30 television channels, a digital business and a movie production business, reaching over 700 million viewers in India.

Business challenge

As part of their digital strategy, our client wanted to optimise user experience across channels — iOS and Android apps, Fire TV, web, and so on — based on user behaviour and preferences. This required a deeper understanding of customer behavioural patterns across platforms.

Presently, they were using Segment as the tool to collect around 6.5 billion records (20TB of raw data) of behavioural data from their 30 million online viewers every month from across sources.

In order to deliver a user-focussed digital viewing experience, the client needed

Reliable storage, with protection against data corruption and other types of data losses
Security against un-authorized data access
Ease of finding a single record in billions (by efficiently indexing data)
An advanced analytics engine that can help them derive and visualise meaningful insights from the client’s high volume and variety of data.
All of this forming their single source of truth.

Solution

We, at 1CloudHub, enabled an enterprise data lake for all of the client’s data to reside in one place — preserving accuracy and timeliness of the data.

Leveraging our client’s existing mechanism to collect and feed data into the data lake, we created a pipeline with EMR (Elastic MapReduce) for data crunching or ETL (Extract, Transform, Load) and Power BI for self-service visualisation.

Our approach

Understand

Define

Design

Transform

Completion and reporting

01. Understand

In collaboration with the client’s development team, we outlined the volume, velocity, veracity and variety of data.

02. Define

We worked with the client’s business teams and domain experts to define reports in Power BI for the 18 use cases the client had identified.

03. Design

We mapped data to corresponding reports and planned data transformation.
Based on these, we designed and architected the data lake and pipeline necessary for Power BI.
With the client’s sign-off, we deployed the solution on AWS cloud.

04. Transform

Once the infrastructure was in place, our data engineering team performed the necessary ETL steps such as cleaning and consolidation to derive value from the raw data.
We stored this in an S3 bucket as parquet formatted files.
We imported transformed data as data-marts into AWS Redshift, to be used for Power BI reports.

05. Completion and reporting

We delivered a summary of findings and recommendations for production deployment to bring the PoC to a meaningful closure.

Outcomes

Better

We enabled advanced analytics for data from up to a year — compared to the 3 months data as per agreement — to deliver the meaningful insights the business teams sought.

Faster

We crunched over 12 million records in under an hour, running more than 100 VMs concurrently in a cluster.

Cheaper

We delivered each report at a cost of $70. At this cost, we delivered an excellent price-to-performance ratio, driven by the spot fleet instances we used and our on-demand or pay-as-you-use cloud model.

A similar setup on-premise in a data centre would have cost the client 12,000 times more.

Looking forward

We are delighted to have helped the client create a centralized, analytics-ready repository for their Big Data and look forward to helping them meet their strategic goals using our cloud capabilities.

Latest case studies

Elevating Financial Operations & Security with AWS Migration

Industry Financial Solution CloudAWSPublished OnDec, 2024 About the client A leading financial services firm, aims to migrate its on-premises infrastructure to AWS to enhance security and align with financial industry compliance standards. The project involves assessing the current infrastructure, deploying AWS landing zone, designing a secure AWS architecture with VPCs, encryption, IAM policies, and threat[...]

Industry

Media

Offering

Cloud Advisory Services

Workload

DC, Servers & Data

Cloud

AWS

Project scope

About the client

Business challenge

Solution

Our approach

Understand

Define

Design

Transform

Completion and reporting

01. Understand

02. Define

03. Design

04. Transform

05. Completion and reporting

Outcomes

Better

Faster

Cheaper

Looking forward

Latest case studies

Elevating Financial Operations & Security with AWS Migration​

Migration of SAAS Smart Carbon Measurement and Management Platform from Azure to AWS

Seamless Migration of Core Banking Applications (LOS, LCS, LMS) from On-Premises to AWS Cloud

Streamlining Application and Database Modernization with Seamless CI/CD Integration

Transforming Mental Health Support Through AI-Driven Transcription and Performance Analytics

Elevating Frontline Operations: Leveraging AI for Efficiency and Customer Delight

Revolutionizing Email Communication: Intelligent Email Generation with Generative AI

Implementing Automated Document Comparison Systems in Financial Services

Revolutionizing Customer Support with AI: Intelligent Chatbot Implementation

Modernization of Document Management System

Containerization and Migration of EC2 Applications to Amazon EKS

Elevating Financial Operations with AWS Migration

Banking Infrastructure Migration to AWS: Achieving Scalability, Security, and Efficiency with 1CloudHub

Enhancing E-Commerce Platforms with Image Processing Capabilities

Migrate and Modernize the MSSQL on RDS to PostgreSQL on Aurora (Babelfish) Using DMS

Migrate from DB on Amazon EC2 to Amazon RDS for MySQL with HA using AWS DMS for Nalli E-Commerce Platform

Migrate from MySQL on Amazon EC2 to Amazon RDS for MySQL

DB Modernization MS SQL on RDS > PostgreSQL on Aurora

AWS Well-Architected Review For A Retail NBFC

Cost effective, Scalable Cloud Solution for connected vehicle telematics

Migrating LOB workloads for NBFC

E-Commerce Platform Migration to AWS Cloud

Application Modernization and Migration of Enterprise Workloads

Financial Services – Application Modernization to enhance agility & scalability

Winning the DevOps Way

Application and Database Modernization along with CI/CD

Using DevOps To Keep An Edge

Migration of large Windows landscape from On-prem Data center to AWS

Migration of e-Commerce portal from On-prem Data Center to AWS

DevOps Implementation- Automated Deployment Process

Data Lake on AWS For Seats and Revenue Analytics

IaC Automation & Data Generation

Migration of IT Infrastructure from OnPrem to Google Cloud

Leveraging AI/ML(Personalization) to Increase Checkout Ratio & Rationalize Discount Coupons for a leading B2C E-Ticketing Platform

App Cloud Maturity Enhancement (Using Containers)​

Pro-Active 24×7 Managed Services

CI/CD Application Deployment Process using Serverless Technology

Knowledge Portal on AWS for a Leading Corporate Compliance in India​

Migration of E-commerce Portal from OnPrem to AWS Cloud

Migration of DNB platform for SMEs from an existing hyper cloud platform to Azure

Personalize fitment to determine AI/ML driven Solution Roadmap for a leading ​ B2C E-Ticketing Platform​

SIFT Customer Engagement Platform on Cloud

Digital Asset Management Platform​

DataLake and Analytics for Digital Exam Platform on AWS

SAP S/4 HANA Functional Enhancement & Implementation

Case Study : Migration of popular news sites to Cloud with Zero Downtime

Case Study : Hospital Information System (HIS) set-up on Cloud

Case Study: SAP ECC Migration on Azure Cloud for a Health Care Manufacturer

Case Study : SAP S/4 HANA Greenfield Infra Implementation

Case Study: Big Data on Cloud

Case Study: DR for geographically diverse SAP

Elevating Financial Operations & Security with AWS Migration

App Cloud Maturity Enhancement (Using Containers)

Knowledge Portal on AWS for a Leading Corporate Compliance in India

Personalize fitment to determine AI/ML driven Solution Roadmap for a leading B2C E-Ticketing Platform

Digital Asset Management Platform