Dataset | ML Models Creation and Analysis for Reconnaissance - Research Project Details

Project Description

Created a dataset of network ports, services, and associated vulnerabilities for use in ML models of the MICO framework proof of concept artefact. The dataset consisted of 32,768 rows and 27 columns. I:

  • Cleaned and analysed the data using NumPy and Pandas
  • Created and evaluated 7 supervised and unsupervised learning algorithms using scikit-learn
  • Utilised 6 various statistical metrics for model evaluation
  • Trained models for target vulnerability score predictions and threat classification
  • Visualized results using Matplotlib
  • Used Jupyter Notebook and Anaconda for development

Additional Skills Demonstrated

  • Project management: wrote weekly progress reports using MS Word and MS Excel
  • Communication: delivered in-house presentations using MS PowerPoint

The skill set required include

  • Technical Skills:
    • Dataset creation and management
    • Data cleaning and analysis (NumPy, Pandas)
    • Machine learning (scikit-learn)
    • Model evaluation (statistical metrics)
    • Data visualization (Matplotlib)
    • Python programming
    • Jupyter Notebook and Anaconda
  • Soft Skills:
    • Project management and organization
    • Communication and presentation
    • Time management and self-discipline (weekly progress reports)