SightX: Preprocessing the Pipeline - ImageNet Math and Augmentation Strategy
Day 8 & 9 They say data is worthless until it is clean. That might be dramatic, but the reality is that 35,000 high-resolution retinal scans sitting in a folder do not mean anything to a neural network until they are resized, normalized, and shaped into the exact format the model expects. This phase was about building the preprocessing pipeline. The bridge between raw JPEG files and tensors ResNet50 can actually train on. Why Preprocessing Is Not Optional The temptation to skip this step is real. The data is downloaded, the model architecture is known, and the urge to just start training is strong. But feeding raw images directly into a pre-trained ResNet50 would fail immediately, not because the model is broken, but because the input format would be completely incompatible with what the pre-trained weights expect. ResNet50 was originally trained on ImageNet: 1.2 million images normalized to specific mean and standard deviation values. If you feed it images with a different...