Dataset | ML Models Creation and Analysis for Reconnaissance - Research Project Details
Project Description
Created a dataset of network ports, services, and associated vulnerabilities for use in ML models of the MICO framework proof of concept artefact. The dataset consisted of 32,768 rows and 27 columns. I:
- Cleaned and analysed the data using NumPy and Pandas
- Created and evaluated 7 supervised and unsupervised learning algorithms using scikit-learn
- Utilised 6 various statistical metrics for model evaluation
- Trained models for target vulnerability score predictions and threat classification
- Visualized results using Matplotlib
- Used Jupyter Notebook and Anaconda for development
Additional Skills Demonstrated
- Project management: wrote weekly progress reports using MS Word and MS Excel
- Communication: delivered in-house presentations using MS PowerPoint
The skill set required include
- Technical Skills:
- Dataset creation and management
- Data cleaning and analysis (NumPy, Pandas)
- Machine learning (scikit-learn)
- Model evaluation (statistical metrics)
- Data visualization (Matplotlib)
- Python programming
- Jupyter Notebook and Anaconda
- Soft Skills:
- Project management and organization
- Communication and presentation
- Time management and self-discipline (weekly progress reports)