The research team developed a survey instrument to gather input from state agencies on their use of Machine Learning (ML) methods and applications. The survey questions were created to capture as much information as possible from each transportation agency not only on their current use of ML but also on their plans for increasing the adoption of ML in their applications. The questions also attempted to capture the challenges, obstacles, or limitations some agencies may have experienced in the development of current applications.
A web-based survey was launched by Old Dominion University (ODU) on July 18, 2022, using the Qualtrics platform, and email invitations were sent out to a large number of contacts at state DOTs. Several reminders were sent to the survey participants to increase the response rate. The survey remained open until September 16, 2022, to allow for participation from as many states as possible. This Chapter documents the results and findings of the survey. The questionnaire used for the survey is provided in Appendix A. The questionnaire includes three main sections as follows:
Part A: Introduction/Familiarity with ML Methods
Part B: ML Applications Developed/Used In-House or Deployed as Commercial Products
Part C: Ongoing and Future Development of ML Applications
If the respondents indicate that their agency is using ML in question A6, they complete Part B. Otherwise, they move directly to Part C of the survey. The findings from the survey are summarized below by providing statistics and/or comments received for each one of the survey questions.
A1. What state is your agency located in?
The research team received 43 survey responses from 29 states (Figure 6) and more than one response was received from a few large states.
A2. Which one of the following best describes your agency/organization?
Of the 43 responses received from different agencies, 42 of them identified with state DOTs, and one response was received from a County Transportation Authority as shown in Table 3.
Table 3 Responses to the survey by agency/organization type.
| Agency/Organization | Number | % |
|---|---|---|
| State Department of Transportation (DOT) | 42 | 97.7% |
| Other (County Transportation Authority) | 1 | 2.3% |
A3. Within your agency, please select an option that best describes the departmental unit you work for.
Participants were asked to identify the department unit they represent. Table 4 shows the broad functional representation of the units that participated in the survey.
Table 4 Responses to the survey by the primary function of the department unit.
| Within your agency, please select an option that best describes the departmental unit you work for. | Number | % |
|---|---|---|
| Central or district administration, finance/fiscal, human resources, public relations, procurement | 2 | 4.7% |
| Research and Development | 6 | 14.0% |
| Engineering – traffic operations, transportation systems operations and management | 10 | 23.3% |
| Engineering – maintenance, asset management | 5 | 11.6% |
| Engineering – construction, rehabilitation, materials | 3 | 7.0% |
| Mobility planning, demand management, tolling, land use | 1 | 2.3% |
| Public transportation, alternative modes of mobility | 1 | 2.3% |
| Information technology, information security | 3 | 7.0% |
| Other | 12 | 27.9% |
A4. How familiar is your agency with ML methods and tools (examples include artificial neural networks, deep learning, decision trees, support vector machines, K-Nearest Neighbors (KNN), K-means, random forest, logistic regression, gradient boosting methods, reinforcement learning)?
When asked about familiarity with ML methods and tools, 23 respondents indicated that they were not very familiar with ML methods and tools (see Table 5). 14 respondents indicated that they are either very familiar or somewhat familiar with ML methods and tools. This shows that only one out of three respondents had some knowledge in this area.
Table 5 Familiarity of respondents with ML tools and methods.
| How familiar is your agency with ML methods and tools? | Number | % |
|---|---|---|
| Very familiar (we have developed or used several ML applications) | 1 | 2.3% |
| Somewhat familiar (we have ML projects in the early stage of development) | 13 | 30.2% |
| Not very familiar (we are aware of ML but have not started actively exploring them) | 23 | 53.5% |
| Not at all familiar (we are not aware of any ML methods or tools that are applicable to our agency operations) | 6 | 14.0% |
A5. Does your agency have Data Scientist, Data Engineer, or Data Analyst employees? How many Data Scientists, Data Engineers, or Data Analysts does your agency employ? Please enter the number of people employed by your agency in these positions.
Respondents were also asked if their agency employed Data Scientists, Data Engineers, or Data Analysts, as shown in Table 6. The number of data scientists, data engineers, or data analysts employed by the agency ranged from 1 to 264. Additionally, less than half of respondents indicated they had positions with their agency that provided data scientist-type services or had similar skill sets. The number of people employed in these roles ranged from 1 to 10. The results indicate some disparity in the needs for skills in data science from one agency to another, depending on the state and the size of the agency.
Table 6 Data science competencies within the agency.
| Does your agency have Data Scientist, Data Engineer, or Data Analyst employees? | Number | % |
|---|---|---|
| Yes | 25 | 58.1% |
| No | 18 | 41.9% |
| Does your agency have any positions that provide data scientist-type services or have similar skill sets within your agency? | Number | % |
| Yes | 8 | 44.4% |
| No | 10 | 55.6% |
A5. Does your agency have any ML applications (models or tools which have been shared for use by agency staff) currently deployed and/or being developed?
19 respondents indicated that their agency had ML applications currently deployed and/or being developed as shown in Table 7. Those who responded with a ‘yes’ to this question came from 15 distinct
states out of the 29 states that responded to the survey. This shows that about half of the participating agencies have adopted or are about to adopt ML in some of their applications.
Table 7 Currently deployed or under development ML applications.
| Does your agency have any ML applications currently deployed and/or being developed? | Number | % |
|---|---|---|
| Yes | 19 | 44.2% |
| No | 24 | 55.8% |
A5. What ML methods has your agency used or are in the development phase for one or more applications?
When asked about the most identified ML method used by their agencies or those that are in the development phase for one or more applications, the responses identified Artificial Neural Networks (ANN) and Deep Learning (DL) as the most common category as shown in Table 8. One respondent indicated that their agency was using/developing support vector machines (SVM).
Table 8 ML Methods adopted or being developed for applications.
| What ML methods has your agency used or are in the development phase for one or more applications? | Number | % |
|---|---|---|
| Artificial Neural Networks (ANN), Deep Learning (DL) | 13 | 30.2% |
| Decision trees, random forests | 3 | 7.0% |
| Support Vector Machines (SVM) | 1 | 2.3% |
| KNN, K-means | 2 | 4.7% |
| Logistic regression | 4 | 9.3% |
| Reinforcement learning methods | 4 | 9.3% |
| Unsupervised learning (e.g., clustering) | 4 | 9.3% |
| Others | 2 | 4.7% |
Respondents were asked to identify for which application areas their agency developed, implemented, or procured ML solutions.
B1 For which of the following application areas has your agency developed/implemented/procured ML solutions?
Table 9 shows that more than one-quarter of respondents identified asset management and maintenance identified transportation systems management and operations (TSMO). One respondent indicated transportation planning demand forecasting, and land use.
Table 9 Application areas with ML solutions adopted by the agency.
| For which of the following application areas has your agency developed/implemented/procured ML solutions? | Number | % |
|---|---|---|
| Transportation systems management and operations | 7 | 16.3% |
| Commercial vehicle freight operations | 2 | 4.7% |
| Asset Management and maintenance | 11 | 25.6% |
| Construction, rehabilitation, materials | 3 | 7.0% |
| Transportation planning demand forecasting, land use | 1 | 2.3% |
| Intersection or road safety improvement | 4 | 9.3% |
| Other | 7 | 16.3% |
B2 Which in-house ML applications (including applications developed that leverage available open-source code such as pre-trained neural networks) is your agency currently using? Please indicate the applications that your organization uses the most often and indicate whether you utilize open-source code.
When asked to identify in-house ML applications being used by the agency, the respondents provided a broad range of responses as listed below.
B3 What commercial ML products (e.g., video analytics programs for traffic volumes, algorithms for detecting incidents, image-based methods for detecting pavement or road conditions) is your agency currently using?
When asked to name commercial ML products adopted by the agency, the respondents provided the list below.
B4 Other than the ML applications currently being used, what additional ML applications are currently being explored or are under consideration for future implementation by your agency? List as many as possible and their application areas.
When asked to name additional ML applications being explored for future implementation by the agency, the respondents provided the list below.
B5 How satisfied is your agency with the ML applications currently in use or development? (e.g., in terms of meeting original objectives, producing implementable results, etc.)
Based on the responses received, Table 10 shows that satisfaction within respondents’ agencies for ML applications currently in use or development was high with 12 respondents reporting being somewhat satisfied and another 1 respondent being very satisfied. 1 respondent reported their agency is not at all satisfied with ML applications currently in use or development.
Table 10 Level of satisfaction with ML applications in use.
| How satisfied is your agency with the ML applications currently in use or development? | Number | % |
|---|---|---|
| Very satisfied | 1 | 5.9% |
| Somewhat satisfied | 12 | 70.6% |
| Not very satisfied | 3 | 17.6% |
| Not at all satisfied | 1 | 5.9% |
B6 Please name one of the most widely adopted ML applications that your agency currently uses most often
Respondents were asked to name one of the most widely adopted ML applications that their agency currently uses most often and answer a series of questions about that application. Names/descriptions of these widely adopted applications included:
Of those who answered questions about the most widely adopted ML application, 10 respondents characterized that application as R&D (application is in the research and development phase), while 5 respondents characterized the application as Production (application is in production and has been implemented in the field) as shown in Table 11. 11 percent indicated Prototype (the application has been tested and is now a prototype).
Table 11 Level of maturity of the ML applications.
| Which one of the following would best characterize the maturity level of the ML application you indicated on the previous page? | Number | % |
|---|---|---|
| R&D: Application is in the research and development stage | 10 | 55.6% |
| Prototype: The application has been tested and is now a prototype | 2 | 11.1% |
| Production: Application is in production and has been implemented in the field | 5 | 27.8% |
| Other | 1 | 5.6% |
When asked about the type of input data used for the ML applications in use, nine respondents identified image data (off-line analysis), while 3 respondents indicated real-time video or real-time sensor data as shown in Table 12. 6 respondents selected “Other” for the type of input data and listed the types below.
Table 12 Type of input data for ML applications.
| What is the type of input data for the ML application? | Number | % |
|---|---|---|
| Real-time video or imagery | 3 | 7.0% |
| Image data (off-line analysis) | 9 | 20.9% |
| Real-time sensor data | 3 | 7.0% |
| Other | 6 | 14.0% |
When asked about the period the agency needed to develop and implement the ML application, the majority of respondents indicated that it took two years or less for their agency to fully implement/develop this application, with eight respondents responding 1-2 years and another 3 respondents indicating it took less than one year as shown in Table 13.
Table 13 Development/implementation period for ML application.
| How long did it take for your agency to fully implement/develop this application? | Number | % |
|---|---|---|
| Less than 1 year | 3 | 23.1% |
| 1-2 years | 8 | 61.5% |
| 3-4 years | 1 | 7.7% |
| More than 4 years | 1 | 7.7% |
When asked how long the agency has been using this ML application in practice, nine respondents reported that their agency has been using the application for less than one year, while 3 respondents have been using the ML application for 3-4 years and two respondents have been using it for 1-2 years as shown in Table 14.
Table 14 Years ML application has been used in practice.
| How long has your agency been using the ML application (in practice for supporting daily operations)? | Number | % |
|---|---|---|
| Less than 1 year | 9 | 64.3% |
| 1-2 years | 2 | 14.3% |
| 3-4 years | 3 | 21.4% |
When asked about the most typical users of the ML application, 16 respondents selected those within the agency/in-house, while 2 respondents selected other state agencies. 1 respondent selected private industry and two respondents selected others as shown in Table 15.
Table 15 Typical ML application users.
| The typical users of this application are…? | Number | % |
|---|---|---|
| Those within my agency/in-house | 16 | 37.2% |
| Other state agencies | 2 | 4.7% |
| Private industry | 1 | 2.3% |
| Others | 2 | 4.7% |
Respondents were asked to indicate the implementation scale of the ML application. 7 respondents indicated the ML application is a single site. Additionally, three respondents indicated the ML application is used regionally and six respondents indicated it is used statewide as shown in Table 16.
Table 16 Implementation scale of ML application.
| Please indicate the implementation scale of this ML application | Number | % |
|---|---|---|
| Statewide | 6 | 33.3% |
| Regional | 3 | 16.7% |
| Single Site | 7 | 38.9% |
| Other | 2 | 11.1% |
When asked to provide an estimate of the annual operating cost of the ML application, 10 respondents reported that the cost was less than $50,000. Very few reported the annual cost to exceed $300K as shown in Table 17.
Table 17 Estimated annual operating cost of ML application.
| What is the estimated annual operating cost for this ML application? | Number | % |
|---|---|---|
| <= $50K | 10 | 66.7% |
| More than $50K to $100K | 1 | 6.7% |
| More than $100K to $200K | 1 | 6.7% |
| More than $200K to $300K | 0 | 0.0% |
| More than $300K to $400K | 2 | 13.3% |
| More than $400K | 1 | 6.7% |
The survey participants were also asked to identify the main benefits of adopting ML methods for this particular application and their responses varied widely. Below is a summary of the benefits identified by the respondents.
Those respondents who indicated that their agency did not have any ML applications currently deployed and/or being developed were asked what ML applications are currently being explored or are under
consideration for future implementation by their agency. Examples of ML applications that are currently being explored/under consideration for future implementation include:
As shown in Table 18, the most common response regarding the agency’s motivation for considering/using ML methods for developing current/future ML applications was the effectiveness of ML methods compared to traditional methods. Further, 10 respondents indicated the main motivation was processing large amounts of data, and 10 respondents indicated reduction in labor costs.
Table 18 Motivation for the adoption of ML methods/solutions.
| What is your agency’s main motivation for considering/using ML methods for developing current and/or future applications? | Number | % |
|---|---|---|
| Processing large amounts of data | 10 | 23.3% |
| Reduction in labor cost | 10 | 23.3% |
| Effectiveness of ML methods compared to traditional methods | 20 | 46.5% |
| Other | 2 | 4.7% |
All respondents were asked what the top three challenges their agency foresees in the development and adoption of future ML applications. As shown in Table 19, the two most commonly reported challenges were a lack of AI/ML skilled workforce (27 respondents) and integrating with existing processes and systems (24 respondents). Respondents also indicated concerns with a lack of dedicated funding (12 respondents), maturity of ML technology (11 respondents), lack of other resources (9 respondents), and data availability (8 respondents).
Table 19 Current and future challenges in the development and adoption of ML applications.
| What challenges does your agency foresee in the development and adoption of future ML applications? | Number | % |
|---|---|---|
| Lack of AI/ML skilled workforce | 27 | 62.8% |
| Stakeholder perception | 4 | 9.3% |
| Data availability | 8 | 18.6% |
| Cost | 7 | 16.3% |
| Lack of dedicated funding | 12 | 27.9% |
| Lack of other resources | 9 | 20.9% |
| Safeguarding the privacy and security of sensitive data | 3 | 7.0% |
| Computational resources | 4 | 9.3% |
| Trustworthiness | 6 | 14.0% |
| Equity and ethical issues | 1 | 2.3% |
| Maturity of ML technology | 11 | 25.6% |
| Integrating with existing processes and systems | 24 | 55.8% |
| Other | 1 | 2.3% |
Many respondents indicated that their agency does not have a vision for adopting ML methods and applications in the near future nor do they have a roadmap for ML adoption.
Below is a summary of the key findings from the survey. In the spirit of employing ML in practice for efficiency, the data collected from the surveys are fed to OpenAI’s ChatGPT, and it is then prompted to summarize the survey results in three to four paragraphs. The bulk of the text below is generated by ChatGPT-4 but a few corrections and revisions were made to improve accuracy and readability.
The survey responses provide a comprehensive overview of the current state of machine learning (ML) adoption and familiarity across 29 different state DOTs in the United States. The survey reveals a varied level of familiarity with ML methods and tools among these agencies. Over half of the respondents admit to being not very familiar with ML, indicating a gap in knowledge and application of these technologies within the sector. Despite this, there is a notable interest in developing and deploying ML applications, with 13 agencies reporting current development or deployment efforts. The employment of data scientists, engineers, or analysts in over half of the agencies suggests a growing recognition of the importance of data science competencies in leveraging ML technologies.
The survey also sheds light on the specific application areas and types of ML methods being explored or utilized by transportation agencies. Asset management and maintenance, along with transportation systems management and operations, emerge as primary areas where ML solutions are being adopted. The use of artificial neural networks and deep learning is particularly prevalent, reflecting the trend toward more sophisticated ML techniques to address complex transportation challenges. Despite the enthusiasm for ML applications, the survey indicates a cautious approach, with many applications still in the research and development or prototype stages. This is further reflected in the satisfaction levels with current ML applications, where the majority of respondents report being somewhat satisfied, suggesting room for improvement in meeting agencies’ objectives and expectations.
Looking toward the future, the survey highlights several challenges and motivations driving the adoption of ML in transportation agencies. The effectiveness of ML methods compared to traditional approaches and the potential for processing large amounts of data and reducing labor costs are significant motivators. However, the lack of an AI/ML skilled workforce and challenges integrating ML with existing processes and systems are seen as major barriers to further adoption. This indicates a critical need for skill development and organizational change to fully leverage ML technologies. Despite these challenges, the interest in exploring new ML applications, such as automated incident detection and predictive models for mobility applications, suggests a forward-looking stance among transportation agencies, albeit tempered by a realistic assessment of the hurdles to widespread ML implementation.