Skip to content Skip to sidebar Skip to footer

Beginner’s Guide to Data Extraction with LangExtract and LLMs

Image by Author   #  Introduction   Did you know that a large portion of valuable information still exists in unstructured text? For example, research papers, clinical notes, financial reports, etc. Extracting reliable and structured information from these texts has always been a challenge. LangExtract is an open-source Python library (released by Google) that solves…

Read More

Pandas: Advanced GroupBy Techniques for Complex Aggregations

Image by Author   #  Introduction   While groupby().sum() and groupby().mean() are fine for quick checks, production-level metrics require more robust solutions. Real-world tables often involve multiple keys, time-series data, weights, and various conditions like promotions, returns, or outliers. This means you frequently need to compute totals and rates, rank items within each segment, roll…

Read More