News

Costain and Cambridge Uni share dataset for highways digital twins

Image of the data as seen in Unreal Engine 5 - for dataset digital twins story
Integration of prepared CAMHighways data in Unreal Engine 5 with the Chaos Vehicle template. The pavement orthomosaic is draped onto the pavement mesh, and examples of road furniture, such as lamp posts and 3D-generated textured signs, are visible. The vegetation is included in the form of point clouds

Costain and Cambridge University researchers have made freely available a significant dataset that should help kickstart the development of digital twins for roads.

The CAMHighways dataset, drawn from 42.8km of mobile mapping data of the A11, A12 and A14, consists of:

  • textured meshes for road assets (including the pavement, traffic signs, and road furniture);
  • segmented and classified point clouds;
  • orthomosaics generated from pavement images;
  • defect label annotations and shapefiles; and
  • ground penetrating radar (GPR) point clouds.

All the data types are georeferenced to the British National Grid and can be integrated into game engines and/or GIS software.

The researchers outline their work in the March edition of Advanced Engineering Informatics. They state: “The dataset is tailored to build a digital twin and designed to enhance highway inspection and maintenance processes.”

Industry-first dataset

The researchers claim their dataset “distinguishes itself from other datasets in several aspects”. First, they state that existing datasets primarily focus on either autonomous driving or defect detection: the CAMHighways dataset can be used for both at the same time. It can also be used to provide a 3D virtual environment for detailed inspection, simulation and testing.

Second, “while most existing datasets tend to focus on urban environments, CAMHighways exclusively provides data collected on highways”. And third, they claim this is the first such dataset to comprise several modalities integrated together, namely imagery, point clouds and GPR data, as well as corresponding defect and asset labels.

“This provides the academic and industry communities a solid footing for running simulations and training machine learning and AI algorithms, laying the groundwork for future infrastructure digital twins,” they state.

Further development

The researchers acknowledge the limitations of their dataset, not least that most of the roads mapped have a concrete surface, or an asphalt surface layer over a concrete base. This means that the defect and GPR data are mostly representative of concrete road-specific conditions and damage.

Work is ongoing, with some of the researchers working on the collection, integration and 3D reconstruction of thermal data in the dataset.

Don’t miss out on information management and digital construction news: sign up to receive the BIMplus newsletter.

Story for BIM+? Get in touch via email: [email protected]

Leave a comment

Your email address will not be published. Required fields are marked *

Latest articles in News