Skip to main content

research.altifigence.com

Altifigence™ Research

Research records for on-device Small Language Models, Transformer bottlenecks, and hardware/software optimization work.

Current focus

On-device model limits and optimization routes.

The first publication and notebooks center on decode-stage memory pressure, FFN versus KV-cache attention behavior, and repeatable latency-benchmark methodology.