it should cache endpoint response if needed if should also ml endpoint response - or used redis 8 RC as chat history db ?