Currently, the examples truncate all documents larger than the context-length supported by the model.
We would need to support inference on documents with any length (doing inference by batch and recombining) if we want to apply it to large corpus such as nemotron-cc.