Skills Projects Experience About Contact
Data Scientist & AI Engineer

Transforming Data into Actionable Insights

Data Scientist with expertise in machine learning, AI, and data engineering. Experienced in deploying practical solutions using LLM models, neural networks, and advanced analytics.

Data Visualization

Technical Skills

Technologies and tools I work with to build innovative solutions

Programming Languages

Python
SQL
R
JavaScript
HTML/CSS

Frameworks & Libraries

TensorFlow
scikit-learn
pandas/numpy
Flask/FastAPI
Apache Spark

Tools & Technologies

Docker
Git/GitHub
PostgreSQL
Apache Kafka
MLflow

Specializations

Machine Learning
Large Language Models
Data Engineering
Geospatial Analysis
MLOps & Deployment

Featured Projects

DEM with Slope Mask and Sentinel-2 RGB

πŸ›°οΈ GeoAI Super-Resolution

Currently developing a deep learning pipeline to enhance spatial resolution of Sentinel-2 satellite imagery for land cover classification, slope detection, and urban growth prediction. Applications include terrain assessment for flooding/landslide risk and real estate development evaluation.

TensorFlow Geospatial AI Satellite Imagery Deep Learning
KDE of Laser Cutt-Off Power Synthetic

🧠 Synthetic Data Generation with LLMs

Bachelor thesis (96% grade) exploring how large language models can generate synthetic data for laser-accelerated ion beam studies. Applied advanced prompt engineering and statistical validation to address data scarcity in specialized physics domains.

LLM Models Prompt Engineering Statistical Analysis Research
Architecture Overview

πŸ“‘ Real-Time IoT Anomaly Detection

Built and deployed a real-time anomaly detection system for IoT sensor data in smart manufacturing. Uses Apache Kafka, Spark Streaming, and Cassandra with Isolation Forest for scalable data processing and automated model drift detection.

Apache Kafka Spark Streaming MLflow Docker
Grafana Dashboard

🌍 Smart City Environmental Monitoring

Real-time data engineering solution for processing environmental sensor data in smart cities. Containerized architecture with threshold breach detection, alerting systems, and Grafana monitoring for proactive municipal response.

Apache Spark Apache Cassandra Grafana Data Engineering
Evaluate Best Model Logistic PCA Predictions Correct Incorrect

🩺 Interpretable ML for Medical Diagnosis

Interpretable machine learning model for breast cancer detection with F1 > 0.95. Features Logistic Regression with PCA, detailed feature analysis, and FastAPI-based Docker deployment with interactive UI for clinical usability.

Logistic Regression Model Interpretability FastAPI Clinical AI
Docker API Hosting

😊 CNN-Based Emotion Recognition

Convolutional Neural Network for real-time emotion classification from facial expressions using MMAFEDB dataset. Features batch normalization, dropout layers, and FastAPI deployment for marketing applications.

TensorFlow CNN Computer Vision FastAPI
AirBnB Database Design

🏘️ SQL Data Mart Design for Airbnb

Designed and implemented a fully functional SQL Data Mart for an Airbnb-style platform using PostgreSQL. Features advanced relational database modeling, 30+ interconnected tables, and optimized queries for analytical reporting.

PostgreSQL Database Design Data Modeling SQL
Clusters by Monthly Theme

πŸ“Š Scientific Trends Analysis with NLP

Unsupervised machine learning analysis of arXiv scientific papers using TF-IDF, PCA, and KMeans clustering. Features scalable processing with Dask Distributed for 4GB+ datasets and interactive topic trend visualizations.

NLP Clustering Dask Distributed Scientific Analysis
Dashboard Bar Hover

πŸ—ΊοΈ Interactive Geospatial Data Visualization

Interactive dashboard for rural population analysis using World Bank data (1960-2022). Features choropleth maps, linked visualizations, and AWS deployment with Dash, Plotly, and Docker for responsive geo-visual storytelling.

Plotly Dash Geospatial Analysis AWS
UML Project Design

πŸ” Automated Web Scraping & Data Wrangling

Automated collection and transformation of data from REST APIs, HTML pages, and WebSocket streams into structured time-series datasets. Features cron-schedulable scrapers with error logging and HDF5 storage.

Web Scraping Beautiful Soup WebSockets Time-Series
Flow Diagram

🧠 Habit Tracker CLI Application

Cross-platform command-line habit tracking app built with OOP principles and SQLite backend. Features habit management, streak analytics, JSON import/export, and comprehensive test coverage with pytest.

Python OOP SQLite CLI Development Testing

Professional Experience

2020 - 2023

Founder

TradeWars

Successfully scaled a cryptocurrency trading simulation platform from concept to 1,000+ active users. Led product development, user acquisition strategies, and technical architecture with focus on growth hacking and market penetration.

  • Achieved 1,000+ user milestone through strategic growth initiatives and viral marketing
  • Architected scalable backend infrastructure handling real-time paper trading executions
  • Executed user acquisition strategies resulting in 300% user base expansion
2017 - Present

Trading Analyst & Community Leader

Trading Channel & Community

Built and scaled a thriving trading community with advanced technical analysis capabilities. Focused on technical analysis, market discussions, cryptocurrencies.

  • Scaled Tradingview following to nearly 4,000 followers through content strategy and engagement
  • Developed popular TradingView scripts with advanced algorithmic trading strategies
  • Featured in Covesting.io's top 7 traders to follow, establishing industry thought leadership
  • Executed content marketing strategy for DueDEX Exchange (2019-2020)

About Me

I'm a Data Scientist with a B.Sc. in Data Science from IU International University of Applied Sciences (final grade 1.5 / GPA 3.7 - top 5% of students). My passion lies in developing AI-driven solutions that solve real-world problems, with extensive experience in machine learning, deep learning, and data engineering.

My work spans generative AI, large language models, anomaly detection, geospatial analysis, and interpretable machine learning. I focus on clean, scalable, and reproducible systems that can move from prototype to production. My thesis on synthetic data generation using LLMs received a 96% grade, showcasing my interest in pushing the boundaries of AI applications.

I'm actively seeking fully remote opportunities focused on AI agents, geospatial intelligence, generative AI, and LLM-powered applications such as RAG-based chatbots. I thrive in agile teams where rapid iteration, curiosity, and innovation are valued.

1.5

Final Grade (German Scale)

3.7

GPA (out of 4.0)

96%

Thesis Grade

10+

Major Projects

Fabian Menne - Data Scientist

Get In Touch

I'm always open to discussing new projects, opportunities, and ideas. Whether you have a question about AI, data science, or just want to say hi, I'll try my best to get back to you!