Explore DSLAB’s curated collection of high-impact resources designed to empower learning, teaching, and research in data science. The site includes books, R packages and algorithms, interactive Shiny dashboards, expert-led slides, and hands-on exercises, all crafted to help you deepen your skills and turn data-driven ideas into action.
An overview of regression models, including linear, variable selection, regularization, non-linear transformations, feature engineering, and generalized regression techniques, aimed at data science students.
This book provides practical guidance on using software for statistical data analysis, aimed at helping companies and institutions leverage their data to maintain competitiveness in the digital age.
An exploration of statistical inference, covering key concepts and practical applications essential for data science and engineering students.
Source code is available at GitHub: github.com/URJCDSLab/InferenciaEstadistica.
The sources and PDF version are also published as open-source at BURJC: hdl.handle.net/10115/41304.
This book guides readers through the complexities of data science, from understanding big data and ethical data governance to mastering statistical techniques and using R for practical applications in various fields.
An introduction to the fundamentals of Machine Learning, focusing on equipping students with the skills to extract valuable insights from data and make informed decisions.
Source code is available at GitHub: github.com/URJCDSLab/AprendizajeAutomaticoI.
The sources and PDF version are also published as open-source at BURJC: hdl.handle.net/10115/27431.
This book covers the typical curriculum of Statistics and Probability courses for Science and Engineering degrees at Spanish universities, aiming to provide a comprehensive and accessible resource for both students and a broader audience.
A foundational synthesis of Discrete Mathematics and Linear Algebra as the bedrock for Computer Science and Cybersecurity.
This book presents Data Science as a tool for understanding, preventing, detecting, and addressing cybersecurity threats, aimed at professionals, students, engineers, and mathematicians.
A comprehensive guide that integrates the Six Sigma methodology with the R programming language to improve processes and solve problems using statistical techniques.
ViLT is a comprehensive framework that integrates an intelligent chatbot to support virtual tutoring for students across various degree programs at ETSII, URJC. It leverages large language models, LangChain, and RAG techniques to optimize natural language queries. The platform allows instructors to upload course materials in PDF format, manage user access for teachers and students, and collect feedback to continuously enhance its performance.
Functions and utilities to perform Statistical Analyses in the Six Sigma way.
Implementation of CSViz: Class Separability Visualization for High-Dimensional Datasets.
Read more about it at: doi.org/10.1007/s10489-023-05149-4
Implementation of CPRViz: Consistent Probability Regions Visualization for model understanding.
Implementation of KRAKEN-SND: Knowledge Recovering Architecture based on Keywords Extraction from Narratives for Suspicious News Detection.
Read more about it at: doi.org/10.1016/j.engappai.2021.104230
Implementation and validation of MOE, “Minimally overfitted learners: a general framework for ensemble learning”
Read more about it at: doi.org/10.1016/j.knosys.2022.109669
Generator Python code for the artificial datasets and some results of the Hostility measure from paper “Hostility measure for multi-level study of data complexity”.
Read more about it at: doi.org/10.1007/s10489-022-03793-w
Data used in the paper “Experts perception-based system to detect misinformation in health websites”.
Read more about it at: doi.org/10.1016/j.patrec.2021.11.008
Implementation and evaluation of “Explanation sets: A general framework for machine learning explainability”.
Read more about it at: doi.org/10.1016/j.ins.2022.10.084
An evolution of the Open FAIR™ Risk Analysis Tool, redesigned as an interactive Shiny application to simplify and enhance Cyber Risk Simulation for cybersecurity students and professionals. It allows users to easily generate simulations using customizable statistical distributions, visualize results dynamically, and export simulated data for further analysis in external tools. Tailored for training and educational purposes, it provides a more accessible and hands-on approach to learning cyber risk modeling.
You can view the slides presenting the tool at useR! 2024 in Salzburg at lcano.com/p/user2024.
Interactive app developed so that readers of book “Aprendizaje Automático” can put their knowledge into practice with tasks such as reading and cleaning data, EDA or working with models in an easy and interactive way.
The source code is also published as open-source at BURJC: hdl.handle.net/10115/27461.
Slides from the book “Inferencia Estadística”.
Slides from the book “Aprendizaje Automático”.
Materials for the teaching innovation seminar “Introduction to Git” in the DSLAB-TI series.
Materials for the “Unsupervised Analysis” Short Course.
Materials for the “Exploratory Data Analysis” Short Course.
Laboratories for the book “Inferencia Estadística”, including both the statements and their solutions in multiple readable formats, such as HTML and PDF. Used data and R sources are also included.
The content is also published as open-source at BURJC: hdl.handle.net/10115/41411.
R scripts with exercises for the book “Inferencia Estadística”, including both the problems and their solutions.
The exercises and solutions are also published as open-source at BURJC: hdl.handle.net/10115/41413.
R scripts with exercises for the book “Aprendizaje Automático”, including both the problems and their solutions.
The exercises and solutions are also published as open-source at BURJC: hdl.handle.net/10115/27432.