computer-vision Questions
3
I have a PyTorch tensor of size (5, 1, 44, 44) (batch, channel, height, width), and I want to 'resize' it to (5, 1, 224, 224)
How can I do that? What functions should I use?
Spinney asked 3/11, 2019 at 1:22
5
Solved
Below is a python script that calculates the homography between two images and then map a desired point from one image to another
import cv2
import numpy as np
if __name__ == '__main__' :
# Re...
Depriest asked 2/7, 2020 at 11:53
3
Solved
I just started to study about Multiview stereo vision.
But I cannot understand disparity and depth
(disparity map and depth map either).
Could you give me the intuition?
Thanks.
Questor asked 6/4, 2020 at 5:10
2
I am training an OCR model for recognizing MRZ from passport. To train my model for more accuracy, I need to train it with maximum pictures possible. I tried to find passport's dataset on KAGGLE bu...
Mainsail asked 3/2, 2020 at 13:11
4
Solved
I calibrated my mono camera using opencv. Now I know the camera intrinsic matrix and distortion coefs [K1, K2, P1 ,P2,K3 ,K4, K5, K6] of my camera. Assuming that camera is place in [x, y, z] with [...
Stouffer asked 28/5, 2015 at 9:35
2
Solved
I was making some experimentations with the OpenCV function cv2.warpPerspective when I decided to code it from scratch to better understand its pipeline. Though I followed (hopefully) every theoret...
Oudh asked 25/9, 2022 at 9:44
4
Solved
I am just starting with computer vision. While running code I got the following error.
[INFO] loading model...
Traceback (most recent call last):
File "detect_faces_video.py", line 24, in
net =...
Pembroke asked 30/9, 2018 at 7:47
3
Solved
I'm using batch normalization with batch size 10 for face detection.
Does batch normalization works with such small batch sizes? If not, then what else can i use for normalization?
Heyday asked 2/7, 2019 at 20:38
4
Solved
Hi have seen a lot of tutorials how to do simple image stitching using two photos and that is no problem.
But what to do when I want to make a panorama from 4-6 images or more?
I have code that tak...
Ectosarc asked 3/7, 2014 at 20:55
3
Solved
I need to find the regional maxima of an image to obtain foreground markers for watershed segmentation. I see in matlab use the function imregionalmax(). As I don't have the matlab software, I use ...
Hellion asked 22/12, 2014 at 7:13
2
Solved
OpenCV => 3.2
Operating System / Platform => Windows 64 Bit
Compiler => Visual Studio 2015
I am currently working on my project which involves vehicle detection and tracking and estimating and o...
Johnsonian asked 22/5, 2017 at 4:23
2
Solved
I want to highlight specific words/sentences in a website screenshot.
Once the screenshot is taken, I extract the text using pytesseract and cv2. That works well and I can get text and data about...
Bullard asked 9/1, 2019 at 17:51
6
Solved
i am trying to read the CIFAR10 datasets, given in batches from https://www.cs.toronto.edu/~kriz/cifar.html>. i am trying to put it in a data frame using pickle and read 'data' part of it. But i am...
Transience asked 29/5, 2016 at 16:29
6
I'm looking for the fastest and more efficient method of detecting an object in a moving video. Things to note about this video: It is very grainy and low resolution, also both the background and f...
Fordo asked 20/11, 2009 at 15:52
2
I'm looking for ways to convert a mask (a Height x Width boolean image) into a series of bounding boxes (see example picture below, which I hand-drew), with boxes encircling the "islands of tr...
Farrica asked 8/8, 2022 at 17:50
4
There are two coordinate systems. We know the 3D coordinates of the origin and the 3D vectors of the axes of the second coordinate system with respect to the first coordinates system. Then how can ...
Catlett asked 21/12, 2015 at 8:57
10
Solved
I am using ffmpeg and getting this error
ffmpeg: error while loading shared libraries: libopenh264.so.5: cannot open shared object file: No such file or directory
I have already checked if the...
Overstrain asked 5/6, 2020 at 10:59
4
Solved
I am wondering, if in Convolutional Neural Networks batch normalization should be applied with respect to every pixel separately, or should I take the mean of pixels with respect to each channel?
...
Incudes asked 21/8, 2017 at 14:40
2
I'm wondering what would be the easiest way to generate a 1D gaussian kernel in python given the filter length. I think that the idea is to evaluate the normal distribution for the values of the ve...
Trinitarianism asked 16/2, 2013 at 23:0
9
I tried to develop an FCN-16 model in Keras. I initialized the weights with similar FCN-16 model weights.
def FCN8 (nClasses, input_height=256, input_width=256):
## input_height and width must ...
Naldo asked 15/12, 2019 at 1:9
5
Solved
Tried to load training data with pytorch torch.datasets.ImageFolder in Colab.
transform = transforms.Compose([transforms.Resize(400),
transforms.ToTensor()])
dataset_path = 'ss/'
dataset = dataset...
Tivoli asked 2/7, 2021 at 17:31
15
Solved
I have searched many places but ALL I get is HOW to install it, not how to verify that it is installed. I can verify my NVIDIA driver is installed, and that CUDA is installed, but I don't know how ...
Imf asked 9/7, 2015 at 18:58
6
While training the model, I encountered the following problem:
RuntimeError: CUDA out of memory. Tried to allocate 304.00 MiB (GPU 0; 8.00 GiB total capacity; 142.76 MiB already allocated; 6.32 GiB...
Jacksnipe asked 16/3, 2022 at 13:53
3
I am currently studying a module in computer vision called edge detection.
I am trying to understand the meaning of gradient orientation and gradient magnitude.
Melly asked 6/11, 2013 at 15:13
2
Solved
I know that one can compute the homography matrix by using at least four correpondence points.
I was wondering if and how can I obtain a homography matrix if I already know the rotation and transl...
Repine asked 28/5, 2017 at 20:12
1 Next >
© 2022 - 2024 — McMap. All rights reserved.