I'm following the getting started guide at [1], but I've removed the MySQL and analytics stuff from the config settings since I do not intend to use any analytics functions. The scdf service however crashes later, saying that no data source is configured.
Ok, so it seems configuring the data source in scdf-config-kafka.yml [2] is still required (though from reading the documentation I assumed it's only for analytic stuff).
But for what? Is the data source used to persist Kafka messages, or establish cloud flow messages between nodes?
I can't find any description about the possible properties for most of the config files. I understand this is a Kubernetes ConfigMap, but that's basically all I could find.
The data source is needed to persist all the stream/task definitions and subsequent batch repositories. Hence, it is not only for analytics.