How effective are AI tools for assessing body composition on CT?

Kate Madden Yee, Senior Editor, AuntMinnie.com. Headshot
2023 02 23 00 33 2026 2023 02 22 Ajr Pooler Image 20230223021627

Artificial intelligence (AI) tools for assessing body composition on CT appear to produce highly accurate technical results, according to research published February 22 in the American Journal of Roentgenology.

Dr. B. Dustin Pooler.Dr. B. Dustin Pooler.

The findings could translate into better patient care, wrote a team led by Dr. B. Dustin Pooler of the University of Wisconsin in Madison.

"Given that over 70 million CT examinations are performed annually in the United States, these tools have tremendous potential to be used for opportunistic health screening on abdominal CT examinations obtained for nearly any indication," the team noted.

A concern about AI assessment tools is whether their performance is consistent across imaging studies conducted at various facilities and equipment types, Pooler and colleagues wrote. To address this concern, the investigators tested the technical performance of a set of three automated tools for AI analysis of abdominal body composition (bone attenuation, muscle composition and attenuation, and amounts of visceral and subcutaneous fat) on a set of CT exams taken at different institutions and on different scanners.

Their study included 8,949 patients who underwent 11,699 abdominal CT exams; the exams were conducted at 777 different institutions on 82 scanners from six different manufacturers. Pooler and colleagues evaluated one axial series of images per exam for the three AI tools' performance in assessing body composition. It defined "technical adequacy" as tool output values within established reference ranges and "failure" as values outside of the reference range.

Axial images at L1 level, without (left) and with (right) segmentation overlay. Red indicates skeletal muscle, green indicates trabecular bone, yellow indicates visceral fat, and blue indicates subcutaneous fat. Segmented regions also include liver (beige) and spleen (orange), which were not evaluated as part of present analysis. (A) 78-year-old woman who underwent abdominopelvic CT at an outside institution. Bone tool returned L1 vertebral body bone attenuation of -146 Hounsfield units (HU), outside of reference range. Thus, the tool was deemed technical failure for bone tool. Failure was attributed to volume averaging of vacuum phenomenon within slice. (B) 64-year-old woman who underwent abdominopelvic CT at an outside institution. Bone tool returned vertebral body bone attenuation of -10,000 HU (default value for segmentation failure detected by tool), outside of reference range. Thus, the tool was deemed technical failure for bone tool. Failure was attributed to the presence of spinal fusion hardware. Images and caption courtesy of the American Journal of Roentgenology.Axial images at L1 level, without (left) and with (right) segmentation overlay. Red indicates skeletal muscle, green indicates trabecular bone, yellow indicates visceral fat, and blue indicates subcutaneous fat. Segmented regions also include liver (beige) and spleen (orange), which were not evaluated as part of present analysis. (A) 78-year-old woman who underwent abdominopelvic CT at an outside institution. Bone tool returned L1 vertebral body bone attenuation of -146 Hounsfield units (HU), outside of reference range. Thus, the tool was deemed technical failure for bone tool. Failure was attributed to volume averaging of vacuum phenomenon within slice. (B) 64-year-old woman who underwent abdominopelvic CT at an outside institution. Bone tool returned vertebral body bone attenuation of -10,000 HU (default value for segmentation failure detected by tool), outside of reference range. Thus, the tool was deemed technical failure for bone tool. Failure was attributed to the presence of spinal fusion hardware. Images and caption courtesy of the American Journal of Roentgenology.

The group reported that the three automated AI tools for measuring body composition were technically adequate in 97.7% of the CT exams. The failure rate was 2.3%, and these failures were primarily (88%) due to an image processing error caused by incorrect DICOM header voxel dimension information (i.e., an anisometry error).

Performance of three types of AI tools for assessing body composition from CT images
Type of tool Technical adequacy rate
Bone attenuation 97.8%
Muscle amount and attenuation 99.1%
Visceral and subcutaneous fat amounts 98%

The authors acknowledged some limitations of the study, including the fact that they only evaluated technical adequacy and did not assess the ability of these AI tools to predict and influence clinical outcomes.

In the end, how can AI body composition assessment tool failures be mitigated? By hewing to AI protocols, the team advised.

"AI tool failure relating to technical factors may be largely preventable through proper acquisition and reconstruction protocols," the researchers concluded.

Page 1 of 156
Next Page