Blog

Quantizing Qwen2.5-1.5B to INT4 with Microsoft Olive and Running It Locally on iOS

March 8, 2026

qwenllmquantizationint4oliveonnxiosflutteron-device-ml

How I used Microsoft Olive's SelectiveMixedPrecision and ModelBuilder passes to quantize Qwen2.5-1.5B-Instruct to INT4 — with strategic INT8 overrides — and deployed it on iOS using ONNX Runtime GenAI and Flutter.

Read article

Quantizing Whisper Base to INT8 and Deploying It on iOS Without a Framework

March 8, 2026

whisperonnxquantizationiosflutteron-device-mlasr

How I exported Whisper Base from Hugging Face, quantized it to INT8 with onnxruntime.quantization, and built the full inference pipeline in Dart — mel spectrogram, tokenizer, autoregressive decoder, token streaming — without sherpa-onnx or any abstraction layer.

Read article

Running Whisper On-Device on iOS: Near Real-Time Transcription Without a Server

March 6, 2026

whisperonnxiosedge-aiflutteron-device-mlasr

How I deployed Whisper Tiny INT8 natively on iOS using sherpa-onnx and Flutter Isolates — no cloud, no latency, no data leaving the device — as part of building a local-first personal health record app.

Read article

ONNX: The Universal Runtime That Makes Edge AI Real

March 2, 2026

onnxedge-aideploymentonnxruntimequantizationinference-optimization

A deep dive into ONNX — the open format and runtime that decouples model training from deployment, enabling the same model to run on CPUs, GPUs, mobile chips, and browsers without rewriting a line of inference code.

Read article

Edge AI Deployment in Healthcare: ONNX, Mobile Inference, and Real-Time Systems

February 25, 2026

edge-aionnxhealthcare-aimedical-imagingreact-nativedeployment

How ONNX-based optimization reduced medical image segmentation inference time by 95%, plus practical edge deployment patterns for mobile and healthcare AI systems.

Read article

The Hardest Simulation Problem in Earth Science: Geodynamics, Computational Cost, and the ML Opportunity

February 23, 2026

machine-learninggeodynamicscomputational-geophysicsneural-operatorsresearch

From million-year tectonic cycles to seconds of earthquake rupture — why geodynamic simulation is computationally brutal, why building training data is even harder, and what machine learning can realistically do about it.

Read article

Physics-Informed Neural Networks: When Deep Learning Learns the Laws of Nature

February 23, 2026

machine-learninggeophysicsPINNsdeep-learningresearch

How Physics-Informed Neural Networks (PINNs) embed PDEs directly into the learning process — and why they are transforming earthquake science, seismic imaging, and fault mechanics.

Read article

Predicting Earthquake Ruptures with Machine Learning

January 11, 2026

machine-learninggeophysicsearthquakeresearch

How Artificial Neural Networks and Random Forests can predict earthquake rupture propagation with >81% accuracy.

Read article