Skip to main content

Introduction to Machine Learning

Dates Expected 2028
Location Wageningen Campus
ECTS 1.5

Scope: Machine learning plays an increasingly important role in many scientific areas, including geo-information science and remote sensing, ecology, biosystems engineering, and bioinformatics. Today, scientific data are growing in complexity, size, and resolution, and scientists are challenged to leverage available data to inform decision making. In this course, you will learn how to model patterns and structures contained in data, and evaluate data-driven models, i.e. models that learn directly from observations the phenomena under study. 

Target group: The course is aimed at PhD candidates, postdocs, and other academics that are interested in machine learning applied to environmental data

Prior knowledge: Basic skills in statistics are a plus. Practicals will be in Python. A short introduction will be provided on the first day, but previous programming experience in R or Python is required

Course duration: 5 days

Contact: PE&RC Office:  office.pe@wur.nl 

Registration of interest: You can register your interest HERE (note: this is not an official registration)

 

Room numbers Forum Building01-06-2026 -> B0106
02-06-2026 -> B0106
03-06-2026 -> B0767
04-06-2026 -> B0107
05-06-2026 -> B0106

Machine learning plays an increasingly important role in many scientific areas, including geo-information science and remote sensing, ecology, biosystems engineering, and bioinformatics. Today, scientific data are growing in complexity, size, and resolution, and scientists are challenged to leverage available data to inform decision making. In this course, you will learn how to model patterns and structures contained in data, and evaluate data-driven models, i.e. models that learn directly from observations the phenomena under study. 

The course will focus on the following topics:

  • The machine learning methodology, and framing scientific problems as machine learning tasks
  • Data preparation and representation
  • Key algorithms for regression, classification, and clustering
  • Qualitative and quantitative comparison of characteristics, (dis)advantages, and performance of a number of key algorithms
  • Design and implementation of effective solutions based on chosen algorithms to solve practical problems
     

Through a series of lectures and practical exercises (in Python), the participants will learn about different strategies and their pertinence for specific problems in environmental sciences, but the course will remain general for a broader audience. Participants are encouraged to bring their own problems in class and analyse data from their own research.

Day 1morning: Introduction to machine learning, methodology and best practices,
afternoon: Introduction to Python, Practical on data preparation and representation, cross validation, training/test splits
Day 2morning: lecture on regression methods: linear, LASSO, feature selection, trees, neural networks
afternoon: practical on regression methods  
Day 3morning: lectures on classification methods: Bayesian, kNN, logistic, SVMs, ensembles, forests 
afternoon: practical on classification methods
Day 4morning: lectures on unsupervised analysis: hierarchical, k-means, EM, PCA, t-SNE 
afternoon: practical on unsupervised analysis 
Day 5morning: Bring your own data – Frame your science question as a learning task and work with own data
afternoon: Feedback/ discussion – Outlook on advanced/current topics (i.e. deep learning)  

 

  • Prof. Ricardo da Silva Torres (Artificial Intelligence Group, Wageningen University & Research)
  • Prof. Ioannis Athanasiadis (Artificial Intelligence Group, Wageningen University & Research)