Skip to content Skip to sidebar Skip to footer

Gaussian Processes from Scratch. Gain a deeper understanding of Gaussian… | by Theo Wolf | Jan, 2024

Gain a deeper understanding of Gaussian processes by implementing them with only NumPy. Gaussian Processes (GPs) are an incredible class of models. There are very few Machine Learning algorithms that give you an accurate measure of uncertainty for free while still being super flexible. The problem is, GPs are conceptually really difficult to understand. Most…

Read More

Generating Synthetic Descriptive Data in PySpark | by Matt Collins | Jan, 2024

Use various data source types to quickly generate text data for artificial datasets. Image generated with DALL-E 3In a previous article, we explored creating many-to-one relationships between columns in a synthetic PySpark DataFrame. This DataFrame only consisted of Foreign Key information and we didn’t produce any textual information that might be useful in a demo…

Read More

How Artificial Intelligence Might be Worsening the Reproducibility Crisis in Science and Technology | by LucianoSphere (Luciano Abriata, PhD) | Jan, 2024

Discussion backed up by some concrete examples, sketching broad guidelines on how to develop better AI systems Photo by National Cancer Institute on UnsplashArtificial Intelligence has become an integral tool in scientific research, but concerns are growing that the misuse of these powerful tools is leading to a reproducibility crisis in science and its technological…

Read More