computer-vision - 2

3

Solved

I need to implement a multi-label image classification model in PyTorch. However my data is not balanced, so I used the WeightedRandomSampler in PyTorch to create a custom dataloader. But when I it...

machine-learning deep-learning computer-vision pytorch

Poky asked 23/3, 2020 at 10:45

1

LSTM object detection tensorflow

Long story short: How to prepare data for lstm object detection retraining of the tensorflow master github implementation. Long story: Hi all, I recently found implementation a lstm object detec...

python tensorflow computer-vision lstm object-detection

Vaish asked 8/1, 2019 at 14:29

3

Measuring the width of each part of a mask in an image which has a circular shape

I have one mask: According to this method After coding, I get the result as below: Since this is a new circular shape rather than a straight line, measuring the width using the rotation method ab...

python opencv image-processing computer-vision

Police asked 1/1 at 10:36

1

Solved

No module named 'PySpin'

I am trying to import PySpin in Visual Studio to be able to work with Flir's camera but every time I encounter this: "No module named 'PySpin'" And when I try the Pip install PySpin, no...

image-processing computer-vision flir pyspin

Revolutionist asked 22/12, 2023 at 15:48

3

Solved

face_recognition problem with face_encodings function

I am a newbie and having difficulty on resolving this issue. What I am trying to do is run the sample code from face_recognition using a webcam. Both of the two example doesn't work on me and keeps...

opencv computer-vision face-recognition dlib

Eveliaevelin asked 4/4, 2023 at 7:36

1

How to combine the results of multiple OCR tools to get better text recognition [closed]

Imagine, you have different OCR tools to read text from images but none of them gives you a 100% accurate output. Combined however, the result could come very close to the ground truth - What...

nlp computer-vision ocr sensor-fusion

Trapper asked 26/3, 2019 at 23:28

3

Solved

OpenCV feature matching multiple objects

How can I find multiple objects of one type on one image. I use ORB feature finder and brute force matcher (opencv = 3.2.0). My source code: import numpy as np import cv2 from matplotlib import p...

opencv image-processing computer-vision orb

Inelegant asked 21/3, 2017 at 21:8

6

Solved

Implementing a Harris corner detector

I am implementing a Harris corner detector for educational purposes but I'm stuck at the harris response part. Basically, what I am doing, is: Compute image intensity gradients in x- and y-direct...

algorithm matlab computer-vision feature-detection corner-detection

Herzegovina asked 5/10, 2010 at 9:3

2

Solved

Implementing log Gabor filter bank

I was reading this paper "Self-Invertible 2D Log-Gabor Wavelets" it defines 2D log gabor filter as such: The paper also states that the filter only covers one side of the frequency space and sh...

python image-processing computer-vision

Topdrawer asked 2/8, 2015 at 16:35

6

Solved

Edge Detection method better than Canny Edge detection

Is there an Edge Detection Method that performs significantly better than the Canny Edge Detector ??

image-processing computer-vision edge-detection

Bodkin asked 27/2, 2014 at 9:59

3

How can I access my laptop's built-in infrared webcam using Python?

I'm trying to access my laptop's built-in infrared webcam (intended for windows hello) in a Python project. I can access the normal RGB camera quite easily using the VideoCapture class from OpenCV,...

python opencv computer-vision

Launcher asked 3/4, 2021 at 15:54

6

Efficiently implementing erode/dilate

So normally and very inefficiently min/max filter is implemented by using four for loops. for( index1 < dy ) { // y loop for( index2 < dx ) { // x loop for( index3 < StructuringElement....

algorithm math computer-vision filtering convolution

Billposter asked 18/2, 2014 at 12:58

2

Solved

Why does the focal length in the camera intrinsics matrix have two dimensions?

In the pinhole camera model there is only one focal length which is between the principal point and the camera center. However, after calculating the camera's intrinsic parameters, the matrix cont...

computer-vision camera-calibration perspectivecamera

Milurd asked 2/5, 2013 at 3:27

5

COCO json annotation to YOLO txt format

how to convert a single COCO JSON annotation file into a YOLO darknet format?? like below each individual image has separate filename.txt file

tensorflow computer-vision object-detection yolo coco

Douzepers asked 15/7, 2021 at 18:18

12

Add padding to images to get them into the same shape

l have a set of images of different sizes (45,50,3), (69,34,3), (34,98,3). l want to add padding to these images as follows: Take the max width and length of the whole images then put the image in...

python image opencv image-processing computer-vision

Extracanonical asked 13/4, 2017 at 11:34

4

How to convert YOLO annotations (.txt) to PASCAL VOC (.xml)?

I have built a dataset to train YOLOv4 and I have all the labels in YOLO format (I used LabelImg). Now I want to train SSD with the same dataset and therefore I need the labels in the PASCAL VOC fo...

computer-vision label object-detection yolo

Gasp asked 4/7, 2021 at 14:36

2

Solved

Up to a scale factor

I am reading up on homographies and i have seen some places that it says that the homography is defined "up to a scale factor" what does this mean? Is there an upper limit for scaling the homograph...

math computer-vision camera-calibration

Ennoble asked 14/6, 2013 at 18:18

3

Effect of variance (sigma) at Gaussian smoothing

I know about Gaussian, variance, and image blurring, and I think that I understood the concept of variance at Gaussian blur, but still I am not 100% sure. I just want to know the role of sigma or v...

opencv image-processing computer-vision

Rickierickman asked 11/4, 2014 at 8:12

2

Solved

Correct Implementation of Dice Loss in Tensorflow / Keras

I've been trying to experiment with Region Based: Dice Loss but there have been a lot of variations on the internet to a varying degree that I could not find two identical implementations. The prob...

tensorflow machine-learning keras deep-learning computer-vision

Cedar asked 11/5, 2022 at 3:43

2

Solved

How to use pt file

I'm trying to make a currency recognition model and I did so using a dataset on kaggle and colab using yolov5 and I exactly carried out the steps explained on yolov5 github. At the end, I downloade...

python pytorch computer-vision yolov5

Lyns asked 1/4, 2022 at 12:35

1

How to detect a flash / Glare in an image of document using skimage / opencv in python?

Please suggest a new approach or at least a method to make any of these robust enough to detect at good rate I have some images (mostly taken from computer screen) where some kind of flash from cam...

python image opencv image-processing computer-vision

Crock asked 1/8, 2021 at 12:53

2

Need help understanding cross_val_score in sklearn python

I am currently trying to implement K-FOLD cross validation in classification using sklearn in python. I understand the basic concept behind K-FOLD and cross validation. However, I dont understand w...

python validation scikit-learn computer-vision

Coolish asked 2/10, 2018 at 15:23

2

Solved

Is it possible to load huggingface model which does not have config.json file?

I am trying to load this semantic segmentation model from HF using the following code: from transformers import pipeline model = pipeline("image-segmentation", model="Carve/u2net-un...

python machine-learning computer-vision huggingface-transformers

Ethelethelbert asked 3/3, 2023 at 12:16

10

Solved

Choosing the correct upper and lower HSV boundaries for color detection with`cv::inRange` (OpenCV)

I have an image of a coffee can with an orange lid position of which I want to find. Here is it . gcolor2 utility shows HSV at the center of the lid to be (22, 59, 100). The question is how to cho...

python opencv computer-vision hsv color-detection

Lamprophyre asked 8/6, 2012 at 12:9

5

Why does one not use IOU for training?

When people try to solve the task of semantic segmentation with CNN's they usually use a softmax-crossentropy loss during training (see Fully conv. - Long). But when it comes to comparing the perfo...

machine-learning computer-vision deep-learning image-segmentation

Meatman asked 7/11, 2016 at 21:48

computer-vision Questions

Recommended topics

Hot tags