Key skills Required:
- Programming Skills – knowledge of statistical programming languages like R, Python, and database query languages like SQL, Hive is desirable.
- Familiarity with Java, Scala is an added advantage.
- Statistics – Good applied statistical skills, including knowledge of statistical tests, distributions, regression, maximum likelihood estimators, etc.
- Good exposure to Machine Learning and NLP would be preferred.
- Strong Math Skills (Multivariable Calculus and Linear Algebra) – Understanding the fundamentals of Multivariable Calculus and Linear Algebra is important as they form the basis of a lot of predictive performance or algorithm optimization techniques.
- Experience with Data Visualization Tools like matplotlib, ggplot, d3.js., Tableau that help to visually encode data
- Hands-on experience with data science tools
- Analytical mind and great business sense
- Proven Experience as Data Analyst or Data Scientist
- Programming experience in Python, Django framework, Numpy/ Pandas and understanding of ML.
- Excellent experience in Python scripting, scraping, analytics pipelines etc
- Good understanding of SQL, NoSQL, MongoDB, ORMs
- Understand Linux, basic dev ops and deployment skills
- Experience writing REST APIs, containers etc.
- Excellent Communication Skills – it is incredibly important to describe findings to a technical and non-technical audience.
- Exposure to AWS, Snowflake, Big Data would be an added advantage.