computer-vision Questions

2

I am trying to determine the best method to extract handwritten data from a scanned document. The handwritten data is in specific boxed areas. I generated the digital version of the document, and...
Surfboard asked 12/2, 2013 at 20:25

2

I know how to do this with OpenCV and PIL. I can't use OpenCV in this project and if I use PIL I have to convert in between image PIL Image and numpy array. I don't want to do that. I'm already usi...
Arabian asked 11/8, 2020 at 22:12

10

Solved

I am using a sobel filter of size 3x3 to calculate the image derivative. Looking at some articles on the internet, it seems that kernels for sobel filter for size 5x5 and 7x7 are also common, but I...

2

Solved

I have a series of concentric rectangles and wish to obtain the means of the outer rectangle excluding the inner rectangle. See the attached diagram , I need to get the mean for the shaded area. ...
Gingham asked 26/12, 2016 at 14:48

2

I have a input tensor of size [1,32,296,400] and I have a pixel set of [1, 56000, 400, 2] After applying grid_sample with mode=‘bilinear’ I have [1, 32, 56000, 400] Can I know what exactly happe...
Avouch asked 6/1, 2020 at 22:58

4

Solved

GIMP has a convenient function that allows you to convert an arbitrary color to an alpha channel. Essentially all pixels become transparent relative to how far away from the chosen color they are....
Durmast asked 8/4, 2019 at 21:37

4

Solved

I have two images and found three similar 2D points using a sift. I need to compute the affine transformation between the images. Unfortunately, I missed lecture and the information out there is a ...

2

I have trained a model on Kaggle on this link : https://www.kaggle.com/dcosmin/shufflenet-with-keras using the source code from this link : https://github.com/opconty/keras-shufflenetV2/blob/maste...
Larrylars asked 29/5, 2020 at 8:10

2

Solved

I have a binary mask for each image, with each mask pixel having value of either 0 or 255. Now because my image segmentation needs images of fixed size, I will have to resize the images and the mas...

2

Solved

I have some existing PyTorch codes with cuda() as below, while net is a MainModel.KitModel object: net = torch.load(model_path) net.cuda() and im = cv2.imread(image_path) im = Variable(torch.from_...

3

Solved

I draw a rectangle on my image using cv2.rectangle(frame,(x,y),(x1,y1),(0,255,0),2) I would like to draw rectangles with text information on them. How do I do it? Are there any ready to use implem...
Brine asked 13/5, 2019 at 8:11

2

I used Nvidia's Transfer Learning Toolkit(TLT) to train and then used the tlt-converter to convert the .etlt model into an .engine file. I want to use this .engine file for inference in python. B...
Viglione asked 11/12, 2019 at 7:29

7

I'm trying to do image classification with the Inception V3 model. Does ImageDataGenerator from Keras create new images which are added onto my dataset? If I have 1000 images, will using this funct...
Microcopy asked 8/8, 2018 at 13:54

3

Solved

I had a chat with an engineer the other day and we both were stumped on a question related to bundle adjustment. For a refresher, here is a good link explaining the problem: http://homepages.inf.e...
Faeroese asked 19/11, 2015 at 20:26

2

Solved

I get often confused with the meaning of the term descriptor in the context of image features. Is a descriptor the description of the local neighborhood of a point (e.g. a float vector), or is a de...

3

I have splitted my training dataset into 80% train and 20% validation data and created DataLoaders as shown below. However I do not want to limit my model's training. So I thought of splitting my d...

2

Solved

I'm creating an image classification model with Inception V3 and have two classes. I've split my dataset and labels into two numpy arrays.The data is split with trainX and testY as the images and t...

3

Solved

I have the following directory structure: ~/emsdk ~/opencv I am trying to build OpenCV.js I do cd ~/opencv python ./platforms/js/build_js.py build_js --emscripten_dir=~/emsdk/ Which results...
Duthie asked 7/5, 2019 at 18:6

4

Solved

I'm attempting to implement an easter egg in a mobile app I'm working on. These easter egg will be triggered when a logo is detected in the camera view. The logo I'm trying to detect is this one: ....
Darb asked 17/3, 2012 at 9:41

2

I would like to create a script in Python (with use OpenCV library) that determines which markers are in the picture. Markers look something like this: Markers After loading the image, the scrip...
Garris asked 4/5, 2016 at 20:13

1

Solved

I have written the following web app to perform pose detection on two videos. The idea is to, say, give a benchmark video in the first and a user video (either a pre-recorded one or their webcam fe...
Copt asked 29/5, 2021 at 1:46

4

Solved

As I get to implement a sliding window using python to detect objects in still images, I get to know the nice function: numpy.lib.stride_tricks.as_strided So I tried to achieve a general rule to...
Chanell asked 24/9, 2011 at 21:43

3

Solved

I am trying to implement a "Digit Recognition OCR" in OpenCV-Python (cv2). It is just for learning purposes. I would like to learn both KNearest and SVM features in OpenCV. I have 100 samples (i....
Nava asked 23/2, 2012 at 12:37

5

Solved

I'm learning image processing using OpenCV for a realtime application. I did some thresholding on an image and want to label the contours in green, but they aren't showing up in green because my im...
Prosimian asked 6/2, 2014 at 7:8

7

Solved

I'm trying to use OpenCV 2.1 to combine two images into one, with the two images placed adjacent to each other. In Python, I'm doing: import numpy as np, cv img1 = cv.LoadImage(fn1, 0) img2 = cv....
Protestant asked 28/9, 2011 at 20:11

© 2022 - 2024 — McMap. All rights reserved.