I have to write a test case in Python to check whether a jpg image is in color or grayscale. Can anyone please let me know if there is any way to do it with out installing extra libraries like OpenCV?
You can check every pixel to see if it is grayscale (R == G == B)
from PIL import Image
def is_grey_scale(img_path):
img = Image.open(img_path).convert('RGB')
w, h = img.size
for i in range(w):
for j in range(h):
r, g, b = img.getpixel((i,j))
if r != g != b:
return False
return True
im
and test them? Or use modulo arithmetic to traverse the image. If sampling(-without-replacement) say 100 random i,j-points isn't conclusive, then just scan it linearly. Or maybe vary the row order with modulo arithmetic. You could wrap all this in a custom iterator iter_pixels(im)
. –
Galvani for i in range(0, w, 2)
and for j in range(0, h, 2)
–
Variolite Can be done as follow:
from scipy.misc import imread, imsave, imresize
image = imread(f_name)
if(len(image.shape)<3):
print 'gray'
elif len(image.shape)==3:
print 'Color(RGB)'
else:
print 'others'
You can check every pixel to see if it is grayscale (R == G == B)
from PIL import Image
def is_grey_scale(img_path):
img = Image.open(img_path).convert('RGB')
w, h = img.size
for i in range(w):
for j in range(h):
r, g, b = img.getpixel((i,j))
if r != g != b:
return False
return True
im
and test them? Or use modulo arithmetic to traverse the image. If sampling(-without-replacement) say 100 random i,j-points isn't conclusive, then just scan it linearly. Or maybe vary the row order with modulo arithmetic. You could wrap all this in a custom iterator iter_pixels(im)
. –
Galvani for i in range(0, w, 2)
and for j in range(0, h, 2)
–
Variolite There is more pythonic way using numpy functionality and opencv:
import cv2
def isgray(imgpath):
img = cv2.imread(imgpath)
if len(img.shape) < 3: return True
if img.shape[2] == 1: return True
b,g,r = img[:,:,0], img[:,:,1], img[:,:,2]
if (b==g).all() and (b==r).all(): return True
return False
For faster processing, it is better to avoid loops on every pixel, using ImageChops, (but also to be sure that the image is truly grayscale, we need to compare colors on every pixel and cannot just use the sum):
from PIL import Image,ImageChops
def is_greyscale(im):
"""
Check if image is monochrome (1 channel or 3 identical channels)
"""
if im.mode not in ("L", "RGB"):
raise ValueError("Unsuported image mode")
if im.mode == "RGB":
rgb = im.split()
if ImageChops.difference(rgb[0],rgb[1]).getextrema()[1]!=0:
return False
if ImageChops.difference(rgb[0],rgb[2]).getextrema()[1]!=0:
return False
return True
In case of a grayscale image, all channels in a certain pixel are equal (if you only have one channel, then you don't have a problem). So basically, you can list all the pixels with their three channel values to check if each pixel has all three channels equal.
Image.getcolors()
returns an unsorted list of (count, pixel) values.
im = Image.open('path_to_image.whatever')
color_count = im.getcolors()
If len(color_count)
exceeds 256 (default max value), this function returns None, meaning you had more than 256 color options in your pixel list, hence it is a colored image (grayscale can only have 256 colors, (0,0,0)
to (255,255,255)
).
So after that you only need :
if color_count:
# your image is grayscale
else:
# your images is colored
Note this will work only when using the default parameter value of getcolors()
.
Documentation: https://pillow.readthedocs.io/en/stable/reference/Image.html#PIL.Image.Image.getcolors
A performance-enhance for fast results: since many images have black or white border, you'd expect faster termination by sampling a few random i,j-points from im and test them? Or use modulo arithmetic to traverse the image rows. First we sample(-without-replacement) say 100 random i,j-points; in the unlikely event that isn't conclusive, then we scan it linearly.
Using a custom iterator iterpixels(im). I don't have PIL installed so I can't test this, here's the outline:
import Image
def isColor(r,g,b): # use tuple-unpacking to unpack pixel -> r,g,b
return (r != g != b)
class Image_(Image):
def __init__(pathname):
self.im = Image.open(pathname)
self.w, self.h = self.im.size
def iterpixels(nrand=100, randseed=None):
if randseed:
random.seed(randseed) # For deterministic behavior in test
# First, generate a few random pixels from entire image
for randpix in random.choice(im, n_rand)
yield randpix
# Now traverse entire image (yes we will unwantedly revisit the nrand points once)
#for pixel in im.getpixel(...): # you could traverse rows linearly, or modulo (say) (im.height * 2./3) -1
# yield pixel
def is_grey_scale(img_path="lena.jpg"):
im = Image_.(img_path)
return (any(isColor(*pixel)) for pixel in im.iterpixels())
(Also my original remark stands, first you check the JPEG header, offset 6: number of components (1 = grayscale, 3 = RGB). If it's 1=grayscale, you know the answer already without needing to inspect individual pixels.)
I faced a similar situation, where I tried the following approaches:
- Reading using
IMREAD_UNCHANGED
and checking for image.shape - Splitting B,G,R channels and checking if they are equal
Both of these approaches got me only like 53% accuracy in my dataset. I had to relax the condition for checking pixels in different channels and create a ratio to classify it as grey or color. With this approach, I was able to get 87.3% accuracy on my dataset.
Here is the logic which worked for me:
import cv2
import numpy as np
###test image
img=cv2.imread('test.jpg')
### splitting b,g,r channels
b,g,r=cv2.split(img)
### getting differences between (b,g), (r,g), (b,r) channel pixels
r_g=np.count_nonzero(abs(r-g))
r_b=np.count_nonzero(abs(r-b))
g_b=np.count_nonzero(abs(g-b))
### sum of differences
diff_sum=float(r_g+r_b+g_b)
### finding ratio of diff_sum with respect to size of image
ratio=diff_sum/img.size
if ratio>0.005:
print("image is color")
else:
print("image is greyscale")
not enough values to unpack (expected 3, got 0)
–
Snakebird img=cv2.imread('path/to/image.jpg')
do a print(img.shape)
and what is the output. –
Unorthodox Old question but I needed a different solution. Sometimes 3 channel images (eg RGB) might be almost grayscale without every pixel being identical in 3 channels. This checks every pixel but you can also subsample the image if needed. I used slope here but you can use checks on most of these parmaters from the regression. Linear regressions are usually very fast due to internal matrix multiply solution.
import glob
import scipy
import cv2
THRESH = 0.01
BASEDIR = 'folder/*.jpg'
files = glob.glob(BASEDIR)
for file in files:
img = cv2.imread(file)
slope1, intercept1, r1, p1, se1 = scipy.stats.linregress(img[:,:,0].flatten(),img[:,:,1].flatten())
slope2, intercept2, r2, p2, se2 = scipy.stats.linregress(img[:,:,0].flatten(),img[:,:,2].flatten())
if abs(slope1 - 1) > THRESH or abs(slope2 - 1) > THRESH:
print(f'{file} is colour')
else:
print(f'{file} is close to grey scale')
(0,255,255)
, (255,0,255)
and (255,255,0)
; if I understand your code, it would be detected as perfect grayscale, but it's not. Anyway a vote up for considering the almost grayscale topic. –
Seve Why wouldn't we use ImageStat module?
from PIL import Image, ImageStat
def is_grayscale(path="image.jpg")
im = Image.open(path).convert("RGB")
stat = ImageStat.Stat(im)
if sum(stat.sum)/3 == stat.sum[0]:
return True
else:
return False
stat.sum gives us a sum of all pixels in list view = [R, G, B] for example [568283302.0, 565746890.0, 559724236.0]. For grayscale image all elements of list are equal.
The best way of checking if a JPEG is greyscale with Pillow, is to open and see if it is in L
mode which Pillow uses for single-channel, greyscale images. If it is L
mode there is nothing else needs doing and you have your answer.
Then you should be aware it could be in CMYK
mode, so you should convert it to RGB
mode for the following step.
Next, you should convert to HSV mode and get the maximum saturation of any pixel. If all pixels are black/white or grey, that will be zero. As the colours in the image become more saturated towards pure reds, greens, blues, cyans, magentas and yellows, so the maximum saturation will tend towards 255.
#!/usr/bin/env python3
from PIL import Image,ImageChops
import sys
def isGreyscale(im):
"""Check if image is greyscale, i.e. unsaturated."""
# If mode is 'L', image is necessarily greyscale, nothing else needs doing
if im.mode == 'L':
return True
# Check if image is not RGB, it may be CMYK and could be Palette if PNG image
if im.mode != 'RGB':
im = im.convert('RGB')
# Convert to HSV and extract saturation
# https://en.wikipedia.org/wiki/HSL_and_HSV
HSV = im.convert('HSV')
H, S, V = HSV.split()
# Get minimum and maximum saturation
minSat, maxSat = S.getextrema()
print(f'{maxSat=}')
# maxSat will approach zero for black, white, and grey images and reach 255 for staturated, colourful images
if maxSat < 1:
return True
return False
if __name__ == "__main__":
im = Image.open(sys.argv[1]);
result = isGreyscale(im)
print(result)
Note that if you are processing images in Python with for
loops, you have almost certainly gone wrong - that is inefficient and error-prone. Try to use Pillow's built-in functions which are coded in C
, or Numpy or OpenCV which is SIMD-optimised.
Here is a version of Alexey Antonenko answer using PIL.image instead of cv2. In case you have float images I think it is safer to use the np.allclose
function.
from PIL import Image
import numpy as np
def isgray(imgpath):
img_pil = Image.open(imgpath)
img = np.asarray(img_pil)
if len(img.shape) < 3: return True
if img.shape[2] == 1: return True
r,g,b = img[:,:,0], img[:,:,1], img[:,:,2]
if np.allclose(r,g) and np.allclose(r,b): return True
return False
Can be done as follow:
import numpy as np
import cv2
image = cv2.imread('f_name',cv2.IMREAD_UNCHANGED)
if(len(image.shape)<3):
if(len(np.unique(image)) == 2):
print('Binary Image')
else:
print('gray')
elif(len(image.shape))==3:
print('Color(RGB)')
else:
print('others')
Just adding my 2 cents from the performance perspective. I tested 4 implementations on my set of 368 images, each of size 1920x1080, 227 were colour images, 141 were in grayscale. Here are implementations and corresponding time results in seconds (note that time comparison exclude jpeg decoding time):
def is_gray_scale1(img: Image) -> bool:
img = img.convert('RGB')
w, h = img.size
for i in range(w):
for j in range(h):
r, g, b = img.getpixel((i, j))
if r != g or g != b:
return False
return True
Elapsed time: 104.14 s
def is_gray_scale2(img: Image) -> bool:
img = img.convert('RGB')
np_img = np.array(img)
r, g, b = np_img[:, :, 0], np_img[:, :, 1], np_img[:, :, 2]
return np.array_equal(r, g) and np.array_equal(g, b)
Elapsed time: 2.94 s
def is_gray_scale3(img: Image) -> bool:
if img.mode not in ("L", "RGB"):
raise ValueError("Unsuported image mode")
if img.mode == "RGB":
rgb = img.split()
if ImageChops.difference(rgb[0], rgb[1]).getextrema()[1] != 0:
return False
if ImageChops.difference(rgb[0], rgb[2]).getextrema()[1] != 0:
return False
return True
Elapsed time: 1.47 s
def is_gray_scale4(img: Image) -> bool:
color_count = img.getcolors()
if color_count:
return True
else:
return False
Elapsed time: 0.34 s (note this is buggy implementation, think of entire image being e.g. red)
L
mode (i.e. greyscale) into RGB is pointless and inefficient. Counting the colours and assuming less than 256 makes it greyscale is clearly wrong - what about an image that is half red and half blue - it will have two colours. –
Marrilee As you are probably correct, OpenCV may be an overkill for this task but it should be okay to use Python Image Library (PIL) for this. The following should work for you:
import Image
im = Image.open("lena.jpg")
EDIT As pointed out by Mark and JRicardo000, you may iterate over each pixel. You could also make use of the im.split() function here.
© 2022 - 2024 — McMap. All rights reserved.