lenatriestounderstand

Latest

Every note, newest first

The full feed, sorted by the most recent update. Use the filter to narrow by topic, tag, or track — or browse by category →.

34 notes

Econometrics

Endogeneity

Endogeneity — what it is formally, why OLS becomes biased and inconsistent under it, the three main sources (omitted variables, simultaneity, measurement error) plus self-selection, how to detect it in practice, and why this is the central problem the rest of the Econometrics track is built around.

  • endogeneity
  • causal-inference
  • ols
  • ovb
Read
Jun 7, 2026
Econometrics

Instrumental Variables (2SLS)

Instrumental Variables / 2SLS as the foundational tool for causal inference in observational data when the treatment is endogenous — the IV idea in plain terms, the four conditions a valid instrument must satisfy (relevance, exclusion, independence, monotonicity), the two-stage estimator, weak-instrument diagnostics beyond the F>10 rule, what IV actually identifies (LATE, not ATE), and worked instrument choices for pricing problems.

  • iv
  • 2sls
  • causal-inference
  • late
Read
Jun 7, 2026
Econometrics

Regression Discontinuity

Regression discontinuity as a quasi-experimental method built on a threshold-based assignment rule — the continuity assumption that does the identification work, sharp and fuzzy variants, the modern bandwidth and diagnostic toolkit, and why the resulting effect is local to the cutoff rather than population-wide.

  • rdd
  • causal-inference
  • quasi-experiment
  • late
Read
Jun 7, 2026
Econometrics

Control Function Approach

The control function approach as the residual-based alternative to 2SLS — when the two agree, what CF's structural assumption actually is, where an ML-flexible first stage helps and where it does not, and how the idea extends through cross-fitting into double / debiased machine learning.

  • causal-inference
  • iv
  • ml-econometrics
  • double-ml
Read
Jun 7, 2026
Econometrics

Panel Data (Fixed Effects)

Fixed effects for panel data — the within-transformation, what it controls for and what it misses, cluster-robust standard errors, the LSDV equivalence, the staggered-DiD problem, and when FE is the right tool.

  • fixed-effects
  • did
  • panel
  • causal-inference
Read
Jun 7, 2026
Econometrics

Causal ML Beyond Econometrics

Causal ML at the meeting point of econometrics and ML — ATE vs CATE, uplift modelling, DML and orthogonal-moments inference, causal forests, counterfactual prediction, off-policy evaluation, and the standard mistakes from treating predictive models as causal.

  • causal-ml
  • uplift
  • dml
  • cate
Read
Jun 7, 2026
Econometrics

Pricing and Elasticity

Pricing as the worked-example for the Econometrics track — why price is endogenous, the identification strategies (IV, FE, RD, DML, CATE), cross-price elasticity and cannibalization, and what-if analysis with its constant-elasticity caveats.

  • pricing
  • elasticity
  • demand
  • causal-inference
Read
Jun 7, 2026
LLM

Fine-Tuning LLMs: When the Weight Delta Is Worth It

Fine-tuning is not prompt repair. It is a decision to write a reusable parameter delta into an existing checkpoint. That delta changes future logits, defaults, and trade-offs. This note is about when that is worth doing: what fine-tuning actually buys, how to tell whether a gap belongs in the weights, and why data, evals, forgetting, and probability shape matter more than the slogan 'just fine-tune it'.

  • fine-tuning
  • residual-stream
Read
May 30, 2026
LLM

Fine-Tuning LLMs: Post-Training Is a Pipeline, Not a Step

Post-training is not one fine-tuning method. It is a sequence of objective signals. Continued pretraining teaches substrate, SFT teaches examples and defaults, preference optimization teaches comparisons, RLVR teaches verifiable trajectories, and distillation transfers the resulting behavior. The important design question is not which acronym is fashionable, but which stage matches the behavior you are trying to install.

  • fine-tuning
  • post-training
  • sft
  • dpo
  • +3
Read
May 30, 2026
LLM

Fine-Tuning LLMs: Modern Post-Training Deep Dive

A reference-style deep dive into the modern knobs around post-training: preference optimization variants, LoRA and PEFT methods, memory-efficient full fine-tuning, model merging, distillation, tooling, and serving. Read this after the pipeline note, once you know which stage you actually need.

  • fine-tuning
  • dpo
  • lora
  • qlora
  • +4
Read
May 30, 2026
LLM

How LLM Generation Works: Transformer, Sampling, Tokens, Batching, and Validation

What happens inside a transformer when you send a prompt, and how the practical knobs — temperature, max_tokens, structured outputs, batching strategy, retry-with-catch-up — fall out of that picture.

  • llm
  • transformers
  • attention
  • tokenization
Read
May 27, 2026
LLM

LLM Agent Architectures

Agent architecture is where LLM engineering stops being mostly about prompts and starts looking like distributed systems. Covers workflows vs agents, the classical loop, five paradigms (ReAct, Function Calling, Plan-and-Execute, Reflection, CodeAct), MCP as the protocol layer above per-vendor function calling, multi-agent patterns, computer use, memory and resumability, production failure modes including indirect prompt injection, tool security, cost levers, and observability.

  • agent-architectures
  • tool-use
  • mcp
  • multi-agent
Read
May 27, 2026
Embeddings and Retrieval

Chunking Strategies

Chunking turns out to be more architectural than it looks. A walk through what a chunk has to satisfy at six different stages simultaneously, what RecursiveCharacterTextSplitter actually does under the hood, and what the 2024–2026 toolbox — late chunking, contextual retrieval, contextualized chunk embeddings, BGE-M3 multi-functionality, ColBERT late interaction — is actually for.

  • chunking
  • rag
Read
May 26, 2026
Embeddings and Retrieval

How Text Became Geometry

Sixty years of incremental work behind the modern embedding vector. From bag-of-words and BM25, through Word2Vec's distributional hypothesis, to contextual embeddings and contrastive retrieval models. The point isn't trivia — it's that each older idea is still in production today, and the picture only makes sense once you've seen the road that led here.

  • embeddings
  • history
  • retrieval
Read
May 21, 2026
Embeddings and Retrieval

Embeddings: How Geometry Pretends to Be Meaning

Embeddings aren't an encoding of text — they're an attempt to make geometry behave as if it carried meaning. What it means to compress text into a fixed-length vector, how contrastive learning turns statistical structure into distances and directions, why cosine similarity works (and when it stops), how dimension, chunking, context window, and reranking change the physics of a retrieval pipeline, and where embeddings lie usefully.

  • embeddings
Read
May 19, 2026
Time Series

Interpretability and Production Maintenance for Deep Learning Time Series

A practical guide to the post-training lifecycle of deep learning forecasters, covering SHAP-based prediction explanations, lightweight recalibration for drift, and the broader production-maintenance roadmap.

  • interpretability
  • shap
  • model-maintenance
Read
May 14, 2026
LLM

The Hindsight Corpus: Time in LLM Pretraining Data

Saying a model was 'trained on text written before T' invites a picture of human knowledge as of T. The actual corpus is volumetrically skewed toward recent years, dominated by retroactively-edited sources like Wikipedia, missing reliable per-document timestamps, and survivor-biased for older periods. The mechanisms, the failure modes that fall out, what's silently absent from datasheets, and what time-aware pretraining would have to do differently.

  • pretraining
  • training-data
  • temporal
Read
May 13, 2026
LLM

The Missing Now: Temporal Grounding in LLM Agents

A chat transcript preserves order but not elapsed time, world state, or whether earlier hypotheses have expired. For long-running agents, temporal grounding is a runtime problem, not a model problem — what 'now' actually is, the failure modes that fall out when context gets treated as state, the primitives (clocks, event logs, state reducers, expectations, monitors) that close the gap, and how to measure whether it works.

  • agents
  • temporal-grounding
  • state-management
Read
May 13, 2026
LLM

Attention Is All You Need — But Not All Attention Is the Same

Why modern LLMs are no longer just decoder-only transformers with standard multi-head attention. Attention has become a design space — MHA, MQA, GQA, MLA, sliding-window, sparse, linear, recurrent, hybrid — plus position encoding, attention sinks, and KV-cache compression. Each variant solves a different bottleneck.

  • attention
  • kv-cache
  • long-context
Read
May 9, 2026
LLM

Prompt Engineering

What separates a working LLM prompt from a flaky one in 2026 — instruction hierarchy, in-context learning, chain-of-thought, structured outputs, reasoning-model specifics, and the prompt-injection trust boundary.

  • prompt-engineering
Read
May 8, 2026
LLM

The Physics of Hallucination

What hallucination looks like at the level of the transformer's internal computation — distributed representations, signal competition in the residual stream, the softmax bottleneck, the activation-output gap, and the architectural reasons there is no first-class epistemic channel.

  • hallucinations
Read
May 8, 2026
Time Series

Practical Training Recipes for Deep Learning Time Series

A practical checklist for training deep learning time-series models, covering windowing, initialization, embeddings, regularization, optimizers, learning-rate schedules, loss functions, and the Keras training workflow.

  • training
  • regularization
  • optimization
Read
May 6, 2026
Time Series

Classical Statistical Forecasting: ARIMA, SARIMA, and SARIMAX

A practical guide to ARIMA, SARIMA, and SARIMAX: how classical statistical forecasting models handle autocorrelation, differencing, seasonality, external regressors, diagnostics, and where they still fit in modern forecasting.

  • arima
  • sarima
  • sarimax
  • statistical-models
Read
May 5, 2026
Time Series

Deep Learning Architectures for Time Series

An overview of deep learning architectures for time-series forecasting: LSTM, TCN, DeepAR, N-BEATS, and TFT, with a focus on input shapes, local vs global training, covariates, probabilistic outputs, and practical model selection.

  • lstm
  • n-beats
  • tft
  • deepar
Read
May 5, 2026
Time Series

RNN and LSTM for Time-Series Forecasting

How recurrent networks model sequences: vanilla RNN, the vanishing-gradient problem, the LSTM gating mechanism, cell state, and the practical use of return_sequences in Keras.

  • lstm
  • rnn
  • vanishing-gradients
Read
May 5, 2026
Time Series

TCN: Causal and Dilated Convolutions for Time Series Forecasting

A practical explanation of Temporal Convolutional Networks for time-series forecasting: causal convolutions, dilations, receptive field, residual blocks, optional GLU gating, output heads, and when TCN is useful as a standalone or hybrid forecasting component.

  • tcn
Read
May 5, 2026
Time Series

Playing with a Hybrid Architecture for Forecasting

Playing with a hybrid forecasting architecture, assembled out of common neural building blocks: TCN front-end, LSTM recurrence, multi-head attention, TFT-style VSN feature routing with GRN, N-BEATS-style decomposition heads, seasonal and event branches.

  • lstm
  • deep-learning
Read
May 5, 2026
Storage and Streaming

Data Storage Formats

Why CSV, JSON, and Parquet all coexist in real pipelines: file size, read speed, and tool compatibility rarely peak in the same format.

  • csv
  • json
  • parquet
Read
May 4, 2026
Storage and Streaming

Data Streams: Kafka and Protobuf

Kafka as a distributed log for streaming data transport, and Protobuf as a compact binary serialization format — how each works, when to use them, and why they pair well together.

  • kafka
  • protobuf
Read
May 4, 2026
Storage and Streaming

Object Storage

Covers the object storage model — buckets, keys, versioning, lifecycle policies — and Minio as a self-hosted S3-compatible implementation.

  • object-storage
  • s3
  • minio
Read
May 4, 2026
Storage and Streaming

Relational Databases

A practitioner's overview of PostgreSQL: how it handles concurrency (MVCC), its type system, index types, and the Python tooling stack for working with it.

  • relational-databases
  • postgresql
  • sqlalchemy
  • alembic
Read
May 4, 2026
Time Series

Time Series Data: Univariate, Multivariate, Panel, and Exogenous Variables

A practical guide to the main types of time series data and the common confusion between multivariate, panel, and exogenous-variable forecasting.

  • forecasting
  • multivariate
Read
May 3, 2026
Time Series

Time Series Preprocessing: Interpolation, Bucketing, Time Zones, and Missing Intervals

A hands-on explanation of how raw time series data is cleaned, regularized, interpolated, bucketed, and prepared for reliable modeling or visualization.

  • preprocessing
  • interpolation
  • bucketing
Read
May 3, 2026
Time Series

Anomaly Detection: Statistical Baselines

An introduction to statistical baseline methods for time-series anomaly detection: Z-Score, IQR, moving averages, simple ensembles, and evaluation metrics.

  • anomaly-detection
  • z-score"
  • iqr
  • ensemble-methods
Read
May 3, 2026