We are working on migrating our application into kubernetes. We use apache camel to process very large files. But the problem is that when we migrate to kubernetes, there will be more than one pod and there is high chance that pods often restart. If the restart happens in the middle of the processing of a huge file,
I acknowledge that there are a lot of situations that can lead to a pod restart in a Kubernetes cluster, but the problems you describe applies to camel applications running outside Kubernetes as well.
To answering your questions, in case of an application restart in the middle of the processing of a large file, according to file component documentation https://github.com/apache/camel/blob/master/camel-core/src/main/docs/file-component.adoc:
That said there are strategies and EIP that can be applied and used like splitter http://camel.apache.org/splitter.html and idempotent consumers http://camel.apache.org/idempotent-consumer.html to split the file in chunks and avoid reprocessing same chunks.