In my mind they both mean reconstructing 3D coordinates from matched points in 2D images. What's the difference between these concepts and multi-view stereo?
Which one do you call an algorithm that computes a sparse point cloud from keypoint matches, and requires both the cameras' external and internal parameters to be known a priori?