Cab dataset. Nov 25, 2022 · However, compared to modern documents, the absence of large-scale historical document layout datasets makes the digitalization of ancient books still in its infancy and awaiting excavation and decryption. It contains GPS coordinates of Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals. By coupling fully automated dataset generation with environment-grounded conversational evaluation, it offers a new lens through which to measure and improve the real-world capabilities of LLM-based developer tools. This enables continuous, automated expansion across diverse repositories without manual intervention. To this end, this paper proposes a large-scale dataset named SCUT-CAB for layout analysis of ancient Chinese books with complex layouts. Uber & Lyft Cab prices Cab and Weather dataset to predict cab prices against weather Data Card Code (35) Discussion (9) Suggestions (0) About Dataset Context New York City (NYC) Taxi & Limousine Commission (TLC) keeps data from all its cabs, and it is freely available to download from its official website. and R. Sachin Wandre5. Jul 14, 2025 · CAB automatically constructs datasets from GitHub issues tagged as questions, using an LLM-driven pipeline that filters noise, extracts runnable contexts, builds executable containers, and verifies environment correctness. jsonl for evaluation — it contains 274 human-verified, high-quality issues (scored 4+ out of 5 by annotators): The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP). azyjv toim wbvsf klf xrhkan gsv vexkf pehwdb vmytdk pyzp