The Data Science and Big Data course for maritime professionals, prepared by Mr. Venkat Krishna in conjunction with Sea and Beyond, is designed to empower mariners, marine engineers, shipping professionals, and analysts to make informed, auditable decisions using data. The curriculum bridges the gap between traditional maritime expertise and modern technologies by progressing from foundational statistics to advanced AI and machine learning applications relevant to the shipping industry.
A unique feature of this maritime data analytics course is its focus on real-world use cases from the maritime environment. By utilizing raw data from main engine parameters, the program allows maritime professionals to directly relate to the data modelling process. This practical approach is integrated throughout the modules, where students apply advanced algorithms such as Random Forest regression and classification specifically to marine datasets, helping them evaluate predictions and improve operational decision-making.
The course is structured into four primary phases:
The journey begins with the data lifecycle, covering collection from IoT devices, cleaning, and Exploratory Data Analysis (EDA). Participants master essential statistical tools such as hypothesis testing, p-values, Z-scores, and probability distributions (PDF and CDF) to infer population characteristics from sample data.
Learners are introduced to Supervised and Unsupervised learning. The curriculum covers linear and multiple regression, feature engineering, and metrics like RMSE and R² to ensure model reliability. Concepts like degrees of freedom and the Central Limit Theorem (CLT) are emphasized to help students avoid overfitting and ensure their models generalize well to new data.
The course dives deep into specialized techniques applied to maritime datasets, including Clustering methods, What-if analysis, and Anomaly detection using Isolation Forest. These tools are essential for operational tasks like identifying sensor faults or predicting maintenance needs. Additionally, time series analysis and moving averages are applied to marine data to enhance predictive accuracy.
To make maritime data science accessible, the course introduces the KNIME low-code platform, allowing users to apply complex regression and classification models to marine data without intensive coding. Finally, participants learn to build and deploy Streamlit applications and use GitHub to manage their work, transforming their models into practical, day-to-day tools for the maritime industry.
This structured roadmap ensures that maritime professionals can transition from understanding the “3 Vs” of Big Data (Volume, Velocity, Variety) to deploying advanced AI agents, RAG systems, and maritime data analytics solutions that support smarter decision-making in modern shipping operations.
At Sea and Beyond we strive for authenticity and honesty in our work. Our mission is to help “You take a well informed decision” and we try and support you through our various services like
We have a team of writers who conduct thorough research to ensure the accuracy of the content and for the clarity in communication. We will be happy to support you wherever required
₹ 15000
English
17:30:00
1. Course Vision
Foundations
2. Core Concepts
Analytics vs. Analysis
3. Types of Variables
With examples for each of the types of variables
4. Distributions
5. Population vs. Sample
With demonstration using a web application.
(link provided for additional practice)
6. Measures of central tendency
Dispersion
7. Correlation
Demonstration how to calculate Correlations in excel.
8. Causality
Demonstration using web application explaining causality.
(link provided for additional practice)
9. Central Limit Theorem (CLT)
Demonstration using a web application explaining CLT application.
(link provided for additional practice)
Additional note explaining difference between Std. deviation & Std. error.
10. Measures of Central tendency demonstration
Demonstration in a web application explaining skew, kurtosis etc.,
(link provided for additional practice).
1. Data Lifecycle
2. Hypothesis Testing
3 Statistical Tools & Concepts
4. Statistical Tools & Concepts
5. Density functions
PDF (Probability Density Function):
CDF (Cumulative Distribution Function):
6. Z-Test vs T-Test:
7. One tailed & two tailed tests:
8. Errors in Testing
9. Data Issues
10. Visualization Techniques
Module 1: Introduction to Machine Learning
Module 2: Independent & dependent variables
Module 3: Dummy variables
Module 4: Linear regression, Multi linear regression
Module 5: Metrics, Multi linear regression demo
Module 6: Understanding the Metrics
Evaluation of metrics for comparing different models for selecting a better
model for predictions and analytics.
Module 7: Splitting data for ML
Splitting data to training and test data
Module 8: Feature engineering and feature scaling
Module 9: Linear regression applying features
Able to evaluate the model using test data after constructing a model for
predictions.
Module 10: Regression vs Classification
Module 11: ANOVA (One way ANOVA)
Module 1: Introduction to AI ML applications
Module 2: Supervised & Unsupervised learning
Module 3: Random Forest intuition
Module 4: Logistic regression intuition
Module 5: Random Forest regression Use case
Module 6: Random Forest classification Use case
Module 7: Clustering methods
Module 8: Whatif analysis using AI ML
Module 9: Anomaly detection using AI ML
Module 10: Time Series & Moving Averages
Module 11: The Rise of AI
Module 12: AI agents Use case
Module 13: RAG Systems
Module 14: mcp servers and Use case
Module 15: KNIME low code platform
Module 16: Regression models in KNIME
Module 17: Classification models in KNIME
Module 18: Optimization in modelling
Module 19: Streamlit applications and Use case
Module 20: VS code, GitHub
Successful completion of this course will earn you a certificate of completion from Sea and Beyond. This certificate will be emailed to you and you could also share and show it on LinkedIn Please click on the button below to purchase the certificate.