RAG systems have a recall problem, not a hallucination one2024-10-02 engineering llm machine_learning"As an AI language model, your RAG pipeline sucks and I'm pretty sure there's information you're not giving me..."Read more...
Obsidian Templater Workflow for Chinese Language Learning2024-09-07 taiwan 中文 obsidianPairing Obisian with Taiwan's Open Source dictionary for vocabulary acquisition and practiceRead more...
Semantic Router: GPT-4o API video sampling via semantic chunking2024-05-14 llm semantic_routerA sampling strategy for the GPT-4o API 2-4 frames per second requirementRead more...
Setup a Remote NVIDIA AI Workbench Node using EC22024-04-01 nvidia devopsHow to run GPU-accelerated ML workloads with ease using NVIDIA's AI Workbench and a CUDA-enabled EC2 instance.
Semantic Router: Postprocessing LLM output using Semantic Splitters2024-03-18 llm semantic_routerTrim extraneous LLM output automatically. No regex, no string parsing.Read more...
Semantic Router: Steer local LLMs for decision-making and more2024-01-16 llm semantic_routerUse LLM and agent input semantic meaning as a superfast decision layer for your applicationsRead more...
Formal Grammars for Large Language Models2023-10-05 llmGet useful, structured output from even the smallest of LLMsRead more...
Quick classification of Taiwan's Indigenous Weapons Programmes2023-09-29 taiwan militaryThe unveiling of Taiwan's Hai Kun-class submarine is flexing the country's military-industrial musclesRead more...
Deploying HuggingFace models on NVIDIA-enabled EKS nodes2023-08-31 engineering devopsUse battle-tested container orchestration for your GPU-enabled inference workloads, with baked-in telemetry from PrometheusRead more...