arxiv LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation