Workshops & Courses
研讨会与课程
W-3
Workshop
Leveraging Open-Source Python Tools for Predictive Modeling in Aquatic Resource Management
TBA
Organizer(s):
Desmond Ugegeh, UNIVERSITY OF DELTA, AGBOR.
Description
Tropical aquatic ecosystems—from vital fisheries to unique freshwater habitats—are at the forefront of the global conservation crisis. They face mounting pressures from habitat destruction, localized pollution, and the accelerating impacts of climate change. The background to this workshop is the urgent necessity for conservation professionals to move beyond reactive observation and embrace proactive, evidence-based management. This requires leveraging the vast amounts of data being collected into sophisticated forecasts about where risk is highest and where conservation intervention will be most effective. To achieve the conference theme of "Long-Term Resilience," we must democratize access to the tools that make data-driven decisions possible. The scope of this intensive two-hour workshop is to provide participants with a complete, start-to-finish pipeline for predictive modeling using accessible, open-source computational tools. This entire session is built on Python, utilizing Google Colaboratory (Colab), which requires no local software installation, ensuring immediate and seamless participation regardless of technical background. The learning journey is divided into three practical modules: We begin with Geospatial Data Integration, where we use the specialized library GeoPandas to combine tabular environmental data (like water quality) with location coordinates to create compelling, professional thematic maps that visualize conservation hot spots. Next, we delve into the crucial step of preparing this data, often called "Feature Engineering," to structure it correctly for forecasting. The central goal of the workshop is to empower attendees to build their own statistically robust predictive models. Using the popular machine learning library Scikit-learn, participants will train a model (such as a Logistic Regression) designed to predict conservation outcomes, like whether a specific aquatic zone is at High Risk of pollution or decline based on its current environmental variables. By the conclusion, the clear objectives will be met: attendees will know how to manage spatial and tabular data concurrently, execute a predictive model, interpret the model’s results (understanding which factors drive the risk), and ultimately, translate those data insights into practical, actionable policy recommendations for aquatic resource management. This workshop provides the fundamental, reproducible skills needed to lead the next generation of data-driven conservation efforts.
Program Outline
Total Duration: 120 Minutes (2 Hours) 0:00 – 0:15 (15 min): Introduction & Setup Topic: The Data-Driven Conservation Imperative Activity: A brief introduction to the principles of Computational Ecology and the workshop's goals. Participants will be guided to open the Google Colab notebook and set up their environment, which includes loading the required libraries (Pandas, GeoPandas, Matplotlib, Scikit-learn). 0:15 – 0:45 (30 min): Module 1: Geospatial Data Integration & Visualization Topic: Mapping Ecosystem Health with GeoPandas Activity (Hands-on): Participants will load and merge spatial data (e.g., sample locations) with tabular data (e.g., water quality metrics, pollution levels). This module focuses on a practical cartography exercise: creating professional-grade thematic maps to visually identify and display conservation hot spots. 0:45 – 1:15 (30 min): Module 2: Feature Engineering and Predictive Modeling Setup Topic: Preparing Data for Scikit-learn Activity (Hands-on): Introduction to the methods for preparing complex ecological data for machine learning. Participants will learn to select relevant features, handle data scaling/normalization, and split the dataset into training and testing sets. This will be based on a clear case study (e.g., predicting an indicator species). 1:15 – 1:45 (30 min): Module 3: Hands-On Predictive Modeling Topic: Training and Interpreting a Simple Machine Learning Model Activity (Hands-on): Participants will train a predictive model (e.g., Logistic Regression or Random Forest) using Scikit-learn to forecast a specific conservation outcome (such as high/low pollution risk). The focus will be on understanding and evaluating model performance metrics (e.g., accuracy, precision). 1:45 – 2:00 (15 min): Conclusion & Future Directions Topic: From Model to Policy: Discussing Reproducibility Activity (Interactive Discussion): A guided discussion on how to interpret the model's outputs and translate these analytical insights into clear, actionable policy recommendations. This segment will also include a final Q&A session and provide resources for participants wishing to explore more advanced topics (e.g., TensorFlow, deep learning).
Materials that participants need to bring:
As this is a hands-on, browser-based workshop utilizing Google Colaboratory (Colab), attendees must bring the following items:
Laptop or Tablet: A personal computing device with a modern web browser (Chrome, Firefox, Safari) and Wi-Fi capability.
Power Cord/Charger: Required to ensure the device remains functional for the entire two-hour session.
Google Account: Participants must have (or create) a free Google account to access and save their work within the Colab environment.
Notebook and Pen.
(Optional but Recommended): For taking notes on model interpretation and policy application insights. No Pre-installed Software is Required.


