Data Scientist
Company | Excella |
---|---|
Location | Arlington, VA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Mid Level, Senior |
Requirements
- 3+ years in a hands-on role performing advanced predictive analytics using tools like Python, R, or Scala.
- 3+ years writing simple to complex SQL queries to obtain data from multiple source systems.
- 3+ years using data mining methods, such as clustering analysis and anomaly detection, to understand data patterns and select appropriate predictive techniques.
- Experience with applied machine learning (tree-based methods, ensemble methods, neural networks/deep learning)
- Proficient understanding of relational (e.g. Oracle, SQL Server, PostgreSQL) and Big Data distributed structures (Hadoop/Spark) in order to source data effectively.
- Excellent communication skills to be able to interact directly with non-technical client stakeholders and act in a business-to-technical translation role.
- Self-motivated and self-managing.
- Proficient in creating reasonable and accurate time estimates for assigned tasks.
Responsibilities
- Working directly with client stakeholders to understand and define analysis objectives and then translate these into actionable results.
- Obtaining data from multiple, disparate data sources including structured, semi-structured and unstructured data.
- Using machine learning and data mining technique to understand the patterns in large volumes of data, identify relationships detect data anomalies, and classify data sets.
- Working with data integration developers to assess data quality and define data processing business rules for cleansing, aggregation, enhancement etc. support analysis and predictive modeling activities.
- Designing and building algorithms and predictive models using techniques such as linear and logistic regression, support vector machines, ensemble models (random forest and/or gradient boosted trees), neural networks, and clustering techniques.
- Deploying predictive models and integrating them into business processes and applications.
- Validating and optimizing model performance upon deployment and tracking over time as necessary.
- Presenting complex analysis results tailored to different audiences (e.g. technical, manager, executive) in a highly consumable and actionable form including the use of data visualizations.
Preferred Qualifications
- Experience using natural language processing techniques preferred.
- Experience using advanced analytics techniques for fraud detection and prevention preferred.
- Experience building machine learning models for production environment preferred.
- Experience working in an onsite client technical consulting environment preferred.
- Experience working within the Agile Scrum Framework.