A new work, entitled Building Legal Datasets, released on Nov 2 from the SMU Centre for Computational Law, emphasised the need for data scientists to recognise that the ‘wild west’ era of ad hoc data gathering is coming to a close, and mirrors the recommendations of a Huawei paper to adopt more stringent habits and methodologies in order to ensure that dataset usage does not expose a project to legal ramifications as the culture changes in time, and as the current global academic activity in the machine learning sector seeks a commercial return on years of investment.