Role purpose:
Assisting in the development and implementation of machine learning solutions to support business objectives, enhance operational efficiency, and improve customer experiences. 
Developing predictive models with large and varied datasets and working collaboratively with a team of professionals across Advanced Machine Learning, technology, data, and customer functions. 
Contributing to the growth and advancement of the Advanced Machine Learning capability across Vodafone globally

Your responsibilities will include:
Assisting in the development of machine learning models for various business areas using the big data platform.
Writing prototype code, such as PySpark, for automating the training and scoring of machine learning models.
Tracking and reporting the performance of machine learning models using tools like Qlik.
Utilizing data visualization techniques to effectively engage stakeholders and communicate insights.
Collaborating with senior data scientists to deliver project milestones to meet business requirements.
Working closely with the Big Data Engineering team to support data ingestion for various use cases.
Collaborating with the Big Data Production Data Engineering team to automate and deploy models into production.

The ideal candidate for the role will have:

Technical / Professional Qualifications:
2-4 years of experience in a similar data science role.
Bachelor's or Master's Degree in quantitative fields such as Mathematics, Statistics, Economics, Computer Science, Engineering, Artificial Intelligence, or related disciplines.
Proficiency in data manipulation, including structured data tools (e.g., SQL) and unstructured data tools and platforms (e.g., Hadoop, Spark, NoSQL).
Familiarity with at least one programming language, such as Python/ Pyspark.
Knowledge of machine learning libraries (e.g., scikit-learn, TensorFlow, Pytorch, H20) and fundamental techniques (e.g., clustering, regression, time-series analysis).
Experience with visualization tools like Tableau or Qlik for data exploration and presentation.
Strong interest in staying updated with the latest advancements and emerging technologies in Machine Learning.
Analytical and expansive thinking with a strong desire to deliver and develop.
Good interpersonal communication and presentation skills
Ability to work in a fast-paced environment

Core competencies, knowledge, and experience:
Programming Languages: Proficiency in programming languages such as Python (preferred) or R  is essential for data analysis, data manipulation, and building machine learning models.
Statistical Knowledge: Understanding of fundamental statistical concepts and methods is crucial for exploratory data analysis, hypothesis testing, and model evaluation.
Data Manipulation and Cleaning: Ability to preprocess and clean data, handle missing values, and ensure data quality is essential for accurate and reliable analysis. Using Big Data Tools to extract and process data , Pyspark , Ray, Dask, Hive.
Data Visualization: Skills in data visualization libraries like Matplotlib, Seaborn, or ggplot2 to create informative and visually appealing plots and charts for better communication of insights. 
Machine Learning: Basic knowledge of machine learning algorithms, including supervised and unsupervised learning, and the ability to apply them to real-world datasets. Experience in production use cases. CRISDM-Process Modelling.
Data Analysis Libraries: Familiarity with data manipulation libraries such as Pandas and data processing frameworks like NumPy. Exposure Analysis in Python and using Pyspark Dataframes
SQL: Proficiency in SQL for querying and managing relational databases to extract and process data efficiently. Tradition Databases and Big Data Data Bases (ATHENA/HIVE/PRESTO/).
Data Storytelling: The ability to communicate results effectively, both verbally and visually, to non-technical stakeholders is crucial for a data scientist to make an impact. 

