AI Inference - Graph View The process of running a trained machine learning model to generate predictions, classifications, or outputs from new input data. View concept details Related ConceptsInference Machine Learning Deep Learning Neural Networks Model Quantization Knowledge Distillation Model Pruning Speculative Decoding Edge AI Fine-Tuning Transformer Text Generation Next-Token Prediction Model Parameters Pre-training ← Back to full graph