Spectrum

Data Engineer

Posted on: 27 Sep 2022

Stamford, CT

Job Description

JOB SUMMARY

The Data Engineer is responsible for maintaining scalable, reliable, consistent and repeatable systems that support data operations for Reporting, Analytics, Applications, and Data Science by gathering and processing raw data at scale. Profiles data to measure quality, integrity, accuracy, and completeness and delivers solutions by developing, testing, and implementing code and scripts. Develop data set processes for data modeling, mining, and consumption.

MAJOR DUTIES AND RESPONSIBILITIES

Actively and consistently supports all efforts to simplify and enhance the customer experience.

Create and maintain scalable, reliable, consistent and repeatable systems that support data operations for Reporting, Analytics, Applications, and Data Science.

Gather and process raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, etc.).

Use ETL processes in order to maintain, improve, clean, and manipulate data.

Profile data to measure quality, integrity, accuracy, and completeness.

Develop and implement tools, scripts, queries, and applications for ETL/ELT and data operations.

Design, build, and automate Machine Learning Data Pipeline.

Deliver solutions by developing, testing, and implementing code and scripts.

Produce reports and uphold data delivery schedules.

Manage life cycle of multiple data sources.

Work closely with stakeholders on the data demand side (analysts and data scientists).

Increase speed to delivery by implementing workload/workflow automation solutions.

Perform other duties as assigned.

REQUIRED QUALIFICATIONS

Skills/Abilities and Knowledge

Ability to read, write, speak and understand English
Ability to use a wide variety of open source technologies and cloud services
Extensive coding/scripting experience using Python, R, shell scripts
Extensive experience with SQL, Tableau, ML Pipeline techniques, and ETL techniques
Extensive background in Linux/Unix/CentOS installation and administration; Windows experience preferred
Extensive knowledge in data storage that demonstrates knowledge of when to use a file system, relational database, or NoSQL variant
Extensive experience with Spark, and Hadoop/Hive
Extensive familiarity with JavaScript API, Rest API or Data Extract APIs
Extensive experience receiving, converting, and cleansing big data
Extensive experience with visualization or BI tools, such as Tableau
Extensive experience with data virtualization concepts, and software (Denodo, Teiid, Jboss)
Extensive experience with data workflow/data prep platforms, such as Infomatica, Pentaho, or Talend
Ability to identify and resolve end-to-end performance, network, server, and platform issues
Effective attention to detail with the ability to effectively prioritize and execute multiple tasks

Education

Bachelor's degree in an engineering discipline or computer science

Related Work Experience

Hands-on working experience with RDBMS, SQL, scripting, and coding - 5-7 years
Linux/Unix/CentOS system admin - 3+ years

PREFERRED QUALIFICATIONS

Skills/Abilities and Knowledge

Extensive experience with JavaScript API, Rest API or Data Extract APIs
Extensive experience with data workflow/data prep platforms, such as Infomatica, Pentaho, or Talend
Expert knowledge of best practices and IT operations in an always-up, always-available service
Extensive experience with data virtualization concepts, and software (Denodo, Teiid, JBoss)
Extensive experience receiving, converting, and cleansing big data
Extensive experience with visualization or BI tools, such as Tableau
Extensive experience receiving, converting, and cleansing data
Extensive knowledge of best practices and IT operations in an always-up, always-available service
Ability to create proof of concept experiments for analytics, machine learning, or visualization tools that include hypothesis, test plans, and outcome analysis

Related Work Experience

Leadership experience in advanced operational analytics

WORKING CONDITIONS

Office environment
Travel depending upon project needs

Spectrum

New York, New York

Time Warner Cable (TWC) was an American cable television company. Before it was purchased by Charter Communications on May 18, 2016, it was ranked the second largest cable company in the United States by revenue behind only Comcast, operating in 29 states. Its corporate headquarters were located in the Time Warner Center in Midtown Manhattan, New York City, with other corporate offices in Stamford, Connecticut; Charlotte, North Carolina; and Herndon, Virginia. From 1971 to 1981, Time Warner Cable, as Warner Cable, owned Dimension Pictures.

It was controlled by Warner Communications, then by Time Warner. That company spun off the cable operations in March 2009 as part of a larger restructuring. From 2009 to 2016, Time Warner Cable was an entirely independent company, continuing to use the Time Warner name under license from its former parent (including the "Road Runner" name for its Internet service, now Spectrum Internet).

In 2014, the company was the subject of a proposed purchase by Comcast Corporation, valued at $45.2 billion; however, following opposition to the deal by various groups, along with plans by the U.S. government to try to block the merger, Comcast called off the deal in April 2015. On May 26, 2015, Charter Communications announced that it would acquire Time Warner Cable for $78.7 billion, along with Bright House Networks in a separate $10.1 billion deal, pending regulatory approval.

The purchase was completed on May 18, 2016; Charter had continued to do business as Time Warner Cable in its former markets, but has now re-branded these operations under the Spectrum brand in most markets (even Charter launched this brand in 2014), though it will continue to use the roadrunner.com email addresses and adelphia.net email addresses to new customers.

 

Similar Jobs