CACI International Data Scientist in Mclean, Virginia
Job Category: Science
Time Type: Full time
Minimum Clearance Required to Start: TS/SCI with Polygraph
Employee Type: Regular
Percentage of Travel Required: None
Type of Travel: None
What You’ll Get to Do:
The Data Scientist (ETL/Java) will be part of a larger team of Software Developers supporting real mission initiatives. The position entails extracting, transforming, and loading raw data into databases using Pentaho, SQL Developer, Java and other tools. Raw data is extracted into database tables and cleansed to make sure information is appropriately categorized. Additionally, the Data Scientist will work with other parts of the team to analyze data, create metrics, and create tools via Machine Learning and Artificial Intelligence that help to more quickly characterize collected data.
More About the Role:
Responsible for completion of each data set in an efficient and timely manner.
Maintaining records of what and how information was extracted, transformed and loaded.
Utilizing knowledge of tools for automation or process improvement, if applicable.
Leverage experience to provide support in the areas of data extraction, transformation and load (ETL), data mapping, data extraction.
Provide analytical support, database support, and maintenance support for data exploitation systems.
Provide and support large-scale file manipulation, data modeling, data mapping, data testing, data quality, and documentation preparation.
Work with other teams, program leadership, and key stake holders as needed.
You’ll Bring These Qualifications:
Experience building software tools to characterize data
Proven expertise and experience with data scripting and manipulation
Development experience in Python
Demonstrated experience performing data ingestion on an enterprise level (structured and unstructured)
Demonstrated ability to analyze, design, build, test, implement and support ETL solutions for multiple subject areas sourced from disparate data sources
Demonstrated experience with basic command line administration in a Linux environment
Demonstrated experience with Oracle development, SQL, and PL/SQL
Demonstrated experience with software configuration management utilizing COTS tools (eg. Subversion)
Demonstrated experience with design mappings for Data Capture, Staging, Cleansing, Transforming, Loading and Auditing
Must have active TS/SCI with Polygraph
These Qualifications Would be Nice to Have:
Experience in data modeling in a large Enterprise class database environment
Experience with Perl, Pentaho, and Shell Scripting
Experience with Data Security policies
Experience Microsoft SQL Server
Familiarity with Hadoop, Marklogic, Hbase, or similar technologies
Ability to analyze, design, build, test, implement and support ETL solutions for multiple subject areas sourced from disparate data sources
Ability to develop ETL design documentation including sourc
What We Can Offer You:
We’ve been named a Best Place to Work by the Washington Post.
Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives.
We offer competitive benefits and learning and development opportunities.
We are mission-oriented and ever vigilant in aligning our solutions with the nation’s highest priorities.
For over 55 years, the principles of CACI’s unique, character-based culture have been the driving force behind our success.
Company Overview: At CACI, you will have the opportunity to make an immediate impact by providing information solutions and services in support of national security missions and government transformation for Intelligence, Defense, and Federal Civilian customers. CACI is an Equal Opportunity Employer – Females/Minorities/Protected Veterans/Individuals with Disabilities.
As directed by Executive Order 14042, all current and newly hired employees are required to be fully vaccinated for COVID-19 by January 18, 2022 and provide proof of vaccination, except where they are legally entitled to an exemption/accommodation.