I'm currently working on a project which requires me to split a TIFF image into a file containing all tags and a file containing all image data and to reconstruct a TIFF image from these files. The only problem is that it seems that LibTIFF provides no easy way to grab all tags from an image. I've tried using TIFFGetTagListCount and then TIFFGetField to retrieve the tag, but this only returns a small subset of the tags. I've started rolling my own version, but I just want to double check and make sure I'm not overlooking something as this seems like a pretty obvious feature that should be included in the library.
Here is the closes you can get for scanning all tags:
#include "LibTIFF/tif_dir.h"
...
TIFFDirectory *td = &tif->tif_dir;
for (int fi = 0, nfi = tif->tif_nfields; nfi > 0; nfi--, fi++) {
const TIFFFieldInfo* fip = tif->tif_fieldinfo[fi];
// test if tag value is set
// (lifted directly form LibTiff _TIFFWriteDirectory)
if( fip->field_bit == FIELD_CUSTOM ) {
int ci, is_set = FALSE;
for( ci = 0; ci < td->td_customValueCount; ci++ )
is_set |= (td->td_customValues[ci].info == fip);
if( !is_set )
continue;
} else if(!TIFFFieldSet(tif, fip->field_bit))
continue;
// else: process the fip->field_tag
}
Note that you will have to take into account that some tags will appear twice (LONG and SHORT version), but only one of these will have value. The correct type to use can be looked up in the included header (the TIFFDirectory struct).
There also other catches on how to read the tags, but this at least will make you loop over all of them (the standard ones). See tif_dirinfo.c for pointers if you get stuck.
tif_dir.h
is an internal header (i.e. not exposed by the built library), including it in causes all sorts of problems like redefinitions, missing dependencies, etc. Any other way around this? –
Erich Printing all tags with tifffile:
from tifffile import TiffFile
for page in TiffFile(path_to_file).pages:
for tag in page.tags.values():
print(tag.name, tag.code, tag.dtype, tag.count, tag.value)
You can use tif_dir
field of the image. It is a struct with at least the following fields:
td_customValueCount
contains the number of 'custom' tag counts,td_customValues
is the list oftd_customValueCount
tag values (indexed from 0), and is of typeTIFFTagValue *
.
So you should be able to do something like this (in true fashion of online code, this is untested, of course!):
for (i=0; i < tiffimage->tif_dir->td_customValueCount; ++i) {
const TIFFFieldInfo *info = tiffimage->tif_dir->td_customValues[i].info;
const char *tagname = info->field_name();
/* process tag */
}
See TIFFDirectory Struct Reference. Hope that helps.
addingtags.html in the libtiff documentation contains information on handling non-standard tags (by default they're ignored). Could that be the problem?
The tiffdump utility (which ships with libtiff) accomplishes this, but looking at the code, it looks like they're mostly working around the library. They actually call lseek
and read
to read the tag info.
Correcting issue with the for loop found in cgohlke's answer above. Also extending with a more complete script so is easier to run from the command-line.
import sys, getopt
from tifffile import TiffFile
def PrintAll(path_to_file):
print('Tag.Name, Tag.Code, Tag.DType, Tag.Count, Tag.Value')
for page in TiffFile(path_to_file).pages:
for tag in page.tags.values():
print(tag.name, tag.code, tag.dtype, tag.count, tag.value)
def main(argv):
try:
opts, args = getopt.getopt(argv, 'hi:v', ["help","image="])
except getopt.GetoptError:
print('Argv error')
sys.exit(2)
for opt, arg in opts:
if opt in ('-h', '--help'):
print('printalltifftags.py -i c:\images\myimage.tif')
sys.exit(0)
elif opt in ('-i', '--image'):
filePath = arg
print('Image: ' + filePath)
PrintAll(filePath)
if __name__ == '__main__':
main(sys.argv[1:])
© 2022 - 2024 — McMap. All rights reserved.