Deep-learning models often fail to generalize across datasets due to differences in data collection and geological properties. Domain adaptation techniques try to fine-tune models across datasets, but their impact on seismic interpretation is not well studied. In this work, we train eight segmentation models on three fault segmentation datasets, Thebe, synthetic, and CRACKS, to study domain shifts. Our results show that domain adaptation in seismic images is closely tied to normalization, as dataset-specific intensity differences strongly affect model performance.

We find that finetuning on Thebe degrades a model’s performance on the same testing setup. A statistical analysis shows that Thebe has a much lower standard deviation (0.124) compared to CRACKS (1.149) and synthetic (1.052). This means the contrast and intensity variations in Thebe are much lower, while CRACKS and synthetic have more diverse intensity distributions. Because of this, models trained on Thebe learn from a narrower data distribution, making them struggle on datasets with wider variations like CRACKS and synthetic. Performance metrics confirm this issue. Models pretrained on CRACKS or synthetic and fine-tuned on Thebe suffer a sharp drop in DICE score and a big increase in Bidirectional Chamfer Distance (BCD) when tested back on CRACKS. This means the model forgets important features needed for fault segmentation in CRACKS, a classic case of catastrophic forgetting. This shows that domain shift in seismic segmentation is caused by data distribution differences rather than feature learning alone.

In seismic data processing, normalization is essential. It helps correct variations in amplitude, signal strength, and noise across datasets. Traditional methods use gain correction, amplitude scaling, and histogram equalization to make seismic data more consistent. But in deep learning, dataset-specific intensity differences are often ignored, making models struggle on new datasets. Our findings suggest that proper normalization before training can reduce domain shift. Methods like global min-max normalization, per-trace standardization, and adaptive histogram equalization could help match dataset distributions and improve model generalization.
In conclusion, our study shows a strong link between dataset statistics and domain adaptation in seismic segmentation. The poor generalization of models trained or fine-tuned on Thebe suggests that normalization differences, not just feature learning, cause domain shift. Using seismic data normalization techniques can help reduce domain shifts and improve model robustness, leading to better deep-learning models for seismic interpretation.