Model Pruning - Graph View A neural network compression technique that removes redundant or low-impact weights, neurons, or entire layers to create smaller, faster models. View concept details Related ConceptsAI Inference Model Quantization Knowledge Distillation Deep Learning Neural Networks Edge AI ← Back to full graph