computer-vision Questions
3
Solved
I need to implement a multi-label image classification model in PyTorch. However my data is not balanced, so I used the WeightedRandomSampler in PyTorch to create a custom dataloader. But when I it...
Poky asked 23/3, 2020 at 10:45
1
Long story short:
How to prepare data for lstm object detection retraining of the tensorflow master github implementation.
Long story:
Hi all,
I recently found implementation a lstm object detec...
Vaish asked 8/1, 2019 at 14:29
3
I have one mask:
According to this method
After coding, I get the result as below:
Since this is a new circular shape rather than a straight line, measuring the width using the rotation method ab...
Police asked 1/1 at 10:36
1
Solved
I am trying to import PySpin in Visual Studio to be able to work with Flir's camera but every time I encounter this:
"No module named 'PySpin'"
And when I try the Pip install PySpin, no...
Revolutionist asked 22/12, 2023 at 15:48
3
Solved
I am a newbie and having difficulty on resolving this issue.
What I am trying to do is run the sample code from face_recognition using a webcam. Both of the two example doesn't work on me and keeps...
Eveliaevelin asked 4/4, 2023 at 7:36
1
Imagine, you have different OCR tools to read text from images but none of them gives you a 100% accurate output. Combined however, the result could come very close to the ground truth - What...
Trapper asked 26/3, 2019 at 23:28
3
Solved
How can I find multiple objects of one type on one image.
I use ORB feature finder and brute force matcher (opencv = 3.2.0).
My source code:
import numpy as np
import cv2
from matplotlib import p...
Inelegant asked 21/3, 2017 at 21:8
6
Solved
I am implementing a Harris corner detector for educational purposes but I'm stuck at the harris response part. Basically, what I am doing, is:
Compute image intensity gradients in x- and y-direct...
Herzegovina asked 5/10, 2010 at 9:3
2
Solved
I was reading this paper "Self-Invertible 2D Log-Gabor Wavelets" it defines 2D log gabor filter as such:
The paper also states that the filter only covers one side of the frequency space and sh...
Topdrawer asked 2/8, 2015 at 16:35
6
Solved
Is there an Edge Detection Method that performs significantly better than the Canny Edge Detector ??
Bodkin asked 27/2, 2014 at 9:59
3
I'm trying to access my laptop's built-in infrared webcam (intended for windows hello) in a Python project. I can access the normal RGB camera quite easily using the VideoCapture class from OpenCV,...
Launcher asked 3/4, 2021 at 15:54
6
So normally and very inefficiently min/max filter is implemented by using four for loops.
for( index1 < dy ) { // y loop
for( index2 < dx ) { // x loop
for( index3 < StructuringElement....
Billposter asked 18/2, 2014 at 12:58
2
Solved
In the pinhole camera model there is only one focal length which is between the principal point and the camera center.
However, after calculating the camera's intrinsic parameters, the matrix cont...
Milurd asked 2/5, 2013 at 3:27
5
how to convert a single COCO JSON annotation file into a YOLO darknet format?? like below
each individual image has separate filename.txt file
Douzepers asked 15/7, 2021 at 18:18
12
l have a set of images of different sizes (45,50,3), (69,34,3), (34,98,3). l want to add padding to these images as follows:
Take the max width and length of the whole images then put the image in...
Extracanonical asked 13/4, 2017 at 11:34
4
I have built a dataset to train YOLOv4 and I have all the labels in YOLO format (I used LabelImg). Now I want to train SSD with the same dataset and therefore I need the labels in the PASCAL VOC fo...
Gasp asked 4/7, 2021 at 14:36
2
Solved
I am reading up on homographies and i have seen some places that it says that the homography is defined "up to a scale factor" what does this mean? Is there an upper limit for scaling the homograph...
Ennoble asked 14/6, 2013 at 18:18
3
I know about Gaussian, variance, and image blurring, and I think that I understood the concept of variance at Gaussian blur, but still I am not 100% sure.
I just want to know the role of sigma or v...
Rickierickman asked 11/4, 2014 at 8:12
2
Solved
I've been trying to experiment with Region Based: Dice Loss but there have been a lot of variations on the internet to a varying degree that I could not find two identical implementations. The prob...
Cedar asked 11/5, 2022 at 3:43
2
Solved
I'm trying to make a currency recognition model and I did so using a dataset on kaggle and colab using yolov5 and I exactly carried out the steps explained on yolov5 github. At the end, I downloade...
Lyns asked 1/4, 2022 at 12:35
1
Please suggest a new approach or at least a method to make any of these robust enough to detect at good rate
I have some images (mostly taken from computer screen) where some kind of flash from cam...
Crock asked 1/8, 2021 at 12:53
2
I am currently trying to implement K-FOLD cross validation in classification using sklearn in python. I understand the basic concept behind K-FOLD and cross validation. However, I dont understand w...
Coolish asked 2/10, 2018 at 15:23
2
Solved
I am trying to load this semantic segmentation model from HF using the following code:
from transformers import pipeline
model = pipeline("image-segmentation", model="Carve/u2net-un...
Ethelethelbert asked 3/3, 2023 at 12:16
10
Solved
I have an image of a coffee can with an orange lid position of which I want to find.
Here is it .
gcolor2 utility shows HSV at the center of the lid to be (22, 59, 100).
The question is how to cho...
Lamprophyre asked 8/6, 2012 at 12:9
5
When people try to solve the task of semantic segmentation with CNN's they usually use a softmax-crossentropy loss during training (see Fully conv. - Long). But when it comes to comparing the perfo...
Meatman asked 7/11, 2016 at 21:48
© 2022 - 2024 — McMap. All rights reserved.