Capital One

Data Scientist - Python Standards Implementation

Posted on: 12 Feb 2021

Mclean, VA

Job Description

1750 Tysons (12023), United States of America, McLean, Virginia

Data Scientist - Python Standards Implementation

At Capital One, were building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is focused on helping our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit. We are succeeding because they are succeeding.

Guided by our shared values, we thrive in an environment where collaboration and openness are valued. We believe that innovation is powered by perspective and that teamwork and respect for each other lead to superior results. We elevate each other and obsess about doing the right thing. Our associates serve with humility and a deep respect for their responsibility in helping our customers achieve their goals and realize their dreams. Together, we are on a quest to change banking for good.

As a Principal Data Scientist in the Retail and Direct Bank, youll be part of a high performing team that is working to define the next generation of banking. The Bank Data Science team has a relentless focus on the craft of modeling, coding, and innovation with a target towards continually improving customer experience and delivering value to the business. Using the latest in machine learning and distributed computing technologies, you will be building the next generation of data products to enable automation and aim for the right decision at the right time for in-the-moment action.

This role is centered around the application of Python and its flexibility across imperative, object-oriented, and functional programming styles. It includes building reusable assets in a Pythonic environment and embodies core principles of The Zen of Python. While an early focus of the role will be on designing and building the model development and execution patterns of the future, there will remain a consistent and ultimately primary intent to establish, educate, and evangelize the best practices required of data scientists to successfully use these platforms with robust and resilient code. The role requires a willingness to teach these principles to other members on the team.

In this role you will:

* Own in-house developed tools & libraries in support of statistical and machine learning model building and deployment to various execution platforms

* New tool and platform discovery and investigation

* Technical documentation in support of playbook(s) standards, FAQs

* Tool adoption, modification, and development to standardize, automate and inner-source best practices for data source access, model development, model promotion to production and model monitoring

* Leverage expertise on platforms and software best practices to enable and improve data scientists code resiliency and performance

* Develop or curate training for software development best practices for data scientist mastery on model build and execution platforms

* Develop code and repo quality standards and train data scientists to adopt and adhere to these standards with structured peer code reviews

* Host office hours or other avenues to assist data scientists in need of assistance on model build and execution platforms and tools

* Engage with the data science community to solicit feedback and lead virtual or in-person training sessions

* Develop and maintain up to date playbooks for the tools and development practices

Basic Qualifications:

* Bachelors Degree plus 5 years of experience in data analytics, or Masters Degree plus 3 years in data analytics, or PhD

* At least 1 year of experience in open source programming languages for large scale data analysis

* At least 1 year of experience with machine learning

* At least 1 year of experience with relational databases

Preferred Qualifications:

* Bachelors or Masters in Computer Science, Computer Engineering, Statistics, or Math plus 3 years of experience in data analytics

* At least 1 year of experience and proficiency in working with AWS (S3, EMR, EC2, IAM, Lambda)

* At least 2 years of experience with containerization (Docker)

* At least 3 years experience in Python

* At least 3 years of experience in PyData software stacks (pandas, numpy, scipy, sklearn, statsmodels)

* At least 3 years experience with machine learning

* At least 3 years experience with SQL

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

Capital One

McLean, VA

Capital One Financial Corporation operates as the bank holding company for the Capital One Bank (USA), National Association; and Capital One, National Association, which provides various financial products and services in the United States, the United Kingdom, and Canada. It operates through three segments: Credit Card, Consumer Banking, and Commercial Banking.

The company offers non-interest-bearing and interest-bearing deposits, such as checking accounts, money market deposit accounts, negotiable order of withdrawals, savings deposits, and time deposits. It also provides credit card loans; auto, home, and retail banking loans; and commercial and multifamily real estate, commercial and industrial, and small-ticket commercial real estate loans. In addition, the company offers credit and debit card products; online direct banking services; and treasury management and depository services.

It serves consumers, small businesses, and commercial clients through the Internet and mobile banking, as well as through Cafés, ATMs, and branches located in New York, Louisiana, Texas, Maryland, Virginia, New Jersey, and the District of Columbia. Capital One Financial Corporation was founded in 1988 and is headquartered in McLean, Virginia.

Similar Jobs