Path: blob/master/site/en-snapshot/io/tutorials/dicom.ipynb
38968 views
Copyright 2019 The TensorFlow IO Authors.
Decode DICOM files for medical imaging
Overview
This tutorial shows how to use tfio.image.decode_dicom_image in TensorFlow IO to decode DICOM files with TensorFlow.
Setup and Usage
Download DICOM image
The DICOM image used in this tutorial is from the NIH Chest X-ray dataset.
The NIH Chest X-ray dataset consists of 100,000 de-identified images of chest x-rays in PNG format, provided by NIH Clinical Center and could be downloaded through this link.
Google Cloud also provides a DICOM version of the images, available in Cloud Storage.
In this tutorial, you will download a sample file of the dataset from the GitHub repo
Note: For more information about the dataset, please find the following reference:
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald Summers, ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases, IEEE CVPR, pp. 3462-3471, 2017
Install required Packages, and restart runtime
Decode DICOM image
Decode DICOM Metadata and working with Tags
decode_dicom_data decodes tag information. dicom_tags contains useful information as the patient's age and sex, so you can use DICOM tags such as dicom_tags.PatientsAge and dicom_tags.PatientsSex. tensorflow_io borrow the same tag notation from the pydicom dicom package.
Documentation
This package has two operations which wrap DCMTK functions. decode_dicom_image decodes the pixel data from DICOM files, and decode_dicom_data decodes tag information. tags contains useful DICOM tags such as tags.PatientsName. The tag notation is borrowed from the pydicom dicom package.
Getting DICOM Image Data
contents: A Tensor of type string. 0-D. The byte string encoded DICOM filecolor_dim: An optionalbool. Defaults toFalse. IfTrue, a third channel will be appended to all images forming a 3-D tensor. A 1024 x 1024 grayscale image will be 1024 x 1024 x 1on_error: Defaults toskip. This attribute establishes the behavior in case an error occurs on opening the image or if the output type cannot accomodate all the possible input values. For example if the user sets the output dtype totf.uint8, but a dicom image stores atf.uint16type.strictthrows an error.skipreturns a 1-D empty tensor.lossycontinues with the operation scaling the value via thescaleattribute.scale: Defaults topreserve. This attribute establishes what to do with the scale of the input values.autowill autoscale the input values, if the output type is integer,autowill use the maximum output scale for example auint8which stores values from [0, 255] can be linearly stretched to fill auint16that is [0,65535]. If the output is float,autowill scale to [0,1].preservekeeps the values as they are, an input value greater than the maximum possible output will be clipped.dtype: An optionaltf.DTypefrom:tf.uint8, tf.uint16, tf.uint32, tf.uint64, tf.float16, tf.float32, tf.float64. Defaults totf.uint16.name: A name for the operation (optional).
Returns A Tensor of type dtype and the shape is determined by the DICOM file.
Getting DICOM Tag Data
contents: A Tensor of type string. 0-D. The byte string encoded DICOM filetags: A Tensor of typetf.uint32of any dimension. Theseuint32numbers map directly to DICOM tagsname: A name for the operation (optional).
Returns A Tensor of type tf.string and same shape as tags. If a dicom tag is a list of strings, they are combined into one string and seperated by a double backslash \\. There is a bug in DCMTK if the tag is a list of numbers, only the zeroth element will be returned as a string.
Bibtex
If this package helped, please kindly cite the below:
License
Copyright 2019 Marcelo Lerendegui, Ouwen Huang, Gradient Health Inc.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
View on TensorFlow.org
Run in Google Colab
View source on GitHub
Download notebook