Add experimental Kafka/Avro source

## Background
Spark Structured Streaming can be used to read data from Kafka with records having Avro format. It would allow to maintain proper checkpoints and exactly-once semantics. 

## Feature
Add experimental Kafka/Avro source.

## Example [Optional]
--

## Proposed Solution [Optional]
Ideas so far:
- Use `foreachBatch()`. Need to make sure checkpoints are updated.
  🛑 This doesn't work because you can't convert a streaming dataframe to batch dataframe. You can only provide a function that processes batch dataframes. Implementing this in Pramen requires interface changes across incremental sources.
- Use kafka batch and save offsets in the offsets bookeeping table.
- If checkpoints are not updated, data can be saved to a temporary location with batchid generation
- Abstract away streaming ingestion as a separate job that does full source+save logic, and do not split it.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add experimental Kafka/Avro source #644

Background

Feature

Example [Optional]

Proposed Solution [Optional]

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add experimental Kafka/Avro source #644

Description

Background

Feature

Example [Optional]

Proposed Solution [Optional]

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions