How to list the files inside a JAR file?
Asked Answered
M

17

152

I have this code which reads all the files from a directory.

    File textFolder = new File("text_directory");

    File [] texFiles = textFolder.listFiles( new FileFilter() {
           public boolean accept( File file ) {
               return file.getName().endsWith(".txt");
           }
    });

It works great. It fills the array with all the files that end with ".txt" from directory "text_directory".

How can I read the contents of a directory in a similar fashion within a JAR file?

So what I really want to do is, to list all the images inside my JAR file, so I can load them with:

ImageIO.read(this.getClass().getResource("CompanyLogo.png"));

(That one works because the "CompanyLogo" is "hardcoded" but the number of images inside the JAR file could be from 10 to 200 variable length.)

EDIT

So I guess my main problem would be: How to know the name of the JAR file where my main class lives?

Granted I could read it using java.util.Zip.

My Structure is like this:

They are like:

my.jar!/Main.class
my.jar!/Aux.class
my.jar!/Other.class
my.jar!/images/image01.png
my.jar!/images/image02a.png
my.jar!/images/imwge034.png
my.jar!/images/imagAe01q.png
my.jar!/META-INF/manifest 

Right now I'm able to load for instance "images/image01.png" using:

    ImageIO.read(this.getClass().getResource("images/image01.png));

But only because I know the file name, for the rest I have to load them dynamically.

Macule answered 15/9, 2009 at 19:28 Comment(4)
Just a thought - why not zip/jar images into a separate file and read the entries in it from your class in another jar?Glossematics
Because it would need an "extra" step for distribution/installation. :( You know, end users.Macule
Given that you have created the jar, you might as well include the list of files within it rather than attempting any tricks.Taiwan
Well, I might be mistaken but jars can be embedded inside other jars. The one-jar(TM) packaging solution ibm.com/developerworks/java/library/j-onejar works on this basis. Except, in your case you do not require the ability load classes.Glossematics
T
111
CodeSource src = MyClass.class.getProtectionDomain().getCodeSource();
if (src != null) {
  URL jar = src.getLocation();
  ZipInputStream zip = new ZipInputStream(jar.openStream());
  while(true) {
    ZipEntry e = zip.getNextEntry();
    if (e == null)
      break;
    String name = e.getName();
    if (name.startsWith("path/to/your/dir/")) {
      /* Do something with this entry. */
      ...
    }
  }
} 
else {
  /* Fail... */
}

Note that in Java 7, you can create a FileSystem from the JAR (zip) file, and then use NIO's directory walking and filtering mechanisms to search through it. This would make it easier to write code that handles JARs and "exploded" directories.

Television answered 15/9, 2009 at 19:48 Comment(8)
hey thanks... been looking for a way to do this for a few hours now !!Quintana
Yes this code works if we want to list all entries inside this jar file. But if I just want to list a subdirectory inside the jar, for example, example.jar/dir1/dir2/, how can I directly list all files inside this subdirectory? Or I need to unzip this jar file? I highly appreciate your help !Reger
Mentioned Java 7 approach is listed in @acheron55's answer.Stoeber
@Stoeber Are you sure acheron55's answer is for Java 7? I didn't found Files.walk() or java.util.Stream in Java 7, but in Java 8: docs.oracle.com/javase/8/docs/api/java/nio/file/Files.htmlDubitation
@BruceSun, in java 7 you can use Files.walkFileTree(...) instead.Stoeber
This only works for extracting class, useless on resources which is most situationDisincentive
@Disincentive That's not correct. This can be used to extract any file in the archive. What gave you your incorrect understanding?Television
Best solution. Solved the problem.Unfrequented
P
106

Code that works for both IDE's and .jar files:

import java.io.*;
import java.net.*;
import java.nio.file.*;
import java.util.*;
import java.util.stream.*;

public class ResourceWalker {
    public static void main(String[] args) throws URISyntaxException, IOException {
        URI uri = ResourceWalker.class.getResource("/resources").toURI();
        Path myPath;
        if (uri.getScheme().equals("jar")) {
            FileSystem fileSystem = FileSystems.newFileSystem(uri, Collections.<String, Object>emptyMap());
            myPath = fileSystem.getPath("/resources");
        } else {
            myPath = Paths.get(uri);
        }
        Stream<Path> walk = Files.walk(myPath, 1);
        for (Iterator<Path> it = walk.iterator(); it.hasNext();){
            System.out.println(it.next());
        }
    }
}
Pinchcock answered 21/1, 2015 at 0:39 Comment(10)
Fantastic!!! but URI uri = MyClass.class.getResource("/resources").toURI(); should have MyClass.class.getClassLoader().getResource("/resources").toURI(); i.e, getClassLoader(). Otherwise it was not working for me.Embank
Don't forget to close fileSystem!Typal
This should be the first answer for 1.8 (walk method in Files is only available in 1.8). The only problem is that the resources directory appears in the Files.walk(myPath, 1), not just the files. I guess the first element can be simply ignoredCardwell
@Cardwell I think it is 1.7, I wasn't using 1.8 when writing that answerPinchcock
This is a good idea, but it is also unsafe. I've added an answer with a safer approach.Ope
I have added a Java 7 implementation of this code and tested it on application servers too. See #1429672Ecosystem
What is the use case for the else path for uri.getScheme().equals("jar")?Malloch
@StefanBormann for if you run it regular (in your IDE) and not in a built jarStator
myPath = fileSystem.getPath("/resources"); doesn't work for me; it doesn't find anything. It should be "images" in my case and the "images"-directory is definitely included in my jar!Stator
@Typal why do we need to close the fileSystem? The default fileSystem is always open.Shatter
M
25

erickson's answer worked perfectly:

Here's the working code.

CodeSource src = MyClass.class.getProtectionDomain().getCodeSource();
List<String> list = new ArrayList<String>();

if( src != null ) {
    URL jar = src.getLocation();
    ZipInputStream zip = new ZipInputStream( jar.openStream());
    ZipEntry ze = null;

    while( ( ze = zip.getNextEntry() ) != null ) {
        String entryName = ze.getName();
        if( entryName.startsWith("images") &&  entryName.endsWith(".png") ) {
            list.add( entryName  );
        }
    }

 }
 webimages = list.toArray( new String[ list.size() ] );

And I have just modify my load method from this:

File[] webimages = ... 
BufferedImage image = ImageIO.read(this.getClass().getResource(webimages[nextIndex].getName() ));

To this:

String  [] webimages = ...

BufferedImage image = ImageIO.read(this.getClass().getResource(webimages[nextIndex]));
Macule answered 16/9, 2009 at 21:41 Comment(0)
O
16

I would like to expand on acheron55's answer, since it is a very non-safe solution, for several reasons:

  1. It doesn't close the FileSystem object.
  2. It doesn't check if the FileSystem object already exists.
  3. It isn't thread-safe.

This is somewhat a safer solution:

private static ConcurrentMap<String, Object> locks = new ConcurrentHashMap<>();

public void walk(String path) throws Exception {

    URI uri = getClass().getResource(path).toURI();
    if ("jar".equals(uri.getScheme())) {
        safeWalkJar(path, uri);
    } else {
        Files.walk(Paths.get(path));
    }
}

private void safeWalkJar(String path, URI uri) throws Exception {

    synchronized (getLock(uri)) {    
        // this'll close the FileSystem object at the end
        try (FileSystem fs = getFileSystem(uri)) {
            Files.walk(fs.getPath(path));
        }
    }
}

private Object getLock(URI uri) {

    String fileName = parseFileName(uri);  
    locks.computeIfAbsent(fileName, s -> new Object());
    return locks.get(fileName);
}

private String parseFileName(URI uri) {
    
    String schemeSpecificPart = uri.getSchemeSpecificPart();
    return schemeSpecificPart.substring(0, schemeSpecificPart.indexOf("!"));
}

private FileSystem getFileSystem(URI uri) throws IOException {

    try {
        return FileSystems.getFileSystem(uri);
    } catch (FileSystemNotFoundException e) {
        return FileSystems.newFileSystem(uri, Collections.<String, String>emptyMap());
    }
}   

There's no real need to synchronize over the file name; one could simply synchronize on the same object every time (or make the method synchronized), it's purely an optimization.

I would say that this is still a problematic solution, since there might be other parts in the code that use the FileSystem interface over the same files, and it could interfere with them (even in a single threaded application).
Also, it doesn't check for nulls (for instance, on getClass().getResource().

This particular Java NIO interface is kind of horrible, since it introduces a global/singleton non thread-safe resource, and its documentation is extremely vague (a lot of unknowns due to provider specific implementations). Results may vary for other FileSystem providers (not JAR). Maybe there's a good reason for it being that way; I don't know, I haven't researched the implementations.

Ope answered 9/6, 2016 at 15:54 Comment(2)
Synchronization of external resources, like FS, does not have much sense within one VM. There can be other applications accessing it outside your VM. Besides even inside your own application, your locking based on filenames can be easily bypassed. With this things it is better to rely on OS synchronization mechanisms, like file locking.Scaly
@Scaly The file-name lock mechanism could totally be bypassed; my answer is not safe enough as well, but I believe it's the most you can get with Java NIO's with minimal effort. Relying on the OS to manage the locks, or not being in control of which applications access which files is a bad practice IMHO, unless you're building a costumer-based app -- say, a text editor. Not managing locks on your own will either cause exceptions to be thrown, or cause threads to block the application - both should be avoided.Ope
S
9

So I guess my main problem would be, how to know the name of the jar where my main class lives.

Assuming that your project is packed in a Jar (not necessarily true!), you can use ClassLoader.getResource() or findResource() with the class name (followed by .class) to get the jar that contains a given class. You'll have to parse the jar name from the URL that gets returned (not that tough), which I will leave as an exercise for the reader :-)

Be sure to test for the case where the class is not part of a jar.

Sense answered 16/9, 2009 at 6:41 Comment(4)
huh - interesting that this would have been down moded without comment... We use the above technique all the time and it works just fine.Sense
An old issue, but to me this seems like a fine hack. Upvoted back to zero :)Sissified
Upvoted, because this is the only solution listed here for the case when a class does not have a CodeSource.Fredericfrederica
Need to remember that a classloader's getResources will search for all classloader paths, so it'll include resources from other jars. So, better to place own resources into uniquely named folder hierarchy.Hedelman
E
9

I've ported acheron55's answer to Java 7 and closed the FileSystem object. This code works in IDE's, in jar files and in a jar inside a war on Tomcat 7; but note that it does not work in a jar inside a war on JBoss 7 (it gives FileSystemNotFoundException: Provider "vfs" not installed, see also this post). Furthermore, like the original code, it is not thread safe, as suggested by errr. For these reasons I have abandoned this solution; however, if you can accept these issues, here is my ready-made code:

import java.io.IOException;
import java.net.*;
import java.nio.file.*;
import java.nio.file.attribute.BasicFileAttributes;
import java.util.Collections;

public class ResourceWalker {

    public static void main(String[] args) throws URISyntaxException, IOException {
        URI uri = ResourceWalker.class.getResource("/resources").toURI();
        System.out.println("Starting from: " + uri);
        try (FileSystem fileSystem = (uri.getScheme().equals("jar") ? FileSystems.newFileSystem(uri, Collections.<String, Object>emptyMap()) : null)) {
            Path myPath = Paths.get(uri);
            Files.walkFileTree(myPath, new SimpleFileVisitor<Path>() { 
                @Override
                public FileVisitResult visitFile(Path file, BasicFileAttributes attrs) throws IOException {
                    System.out.println(file);
                    return FileVisitResult.CONTINUE;
                }
            });
        }
    }
}
Ecosystem answered 11/10, 2016 at 9:38 Comment(1)
This code doesn't work for me, I get NullPointerException on line with ResourceWalker.class.getResource("/resources").toURI(), and if I use "" as parameter, I get the following error: Starting from: rsrc:com/betalord/sgx/util/ java.nio.file.FileSystemNotFoundException: Provider "rsrc" not installedSchapira
M
7

Just to mention that if you are already using Spring, you can take advantage of the PathMatchingResourcePatternResolver.

For instance to get all the PNG files from a images folder in resources

ClassLoader cl = this.getClass().getClassLoader(); 
ResourcePatternResolver resolver = new PathMatchingResourcePatternResolver(cl);
Resource[] resources = resolver.getResources("images/*.png");
for (Resource r: resources){
    logger.info(r.getFilename());
    // From your example
    // ImageIO.read(cl.getResource("images/" + r.getFilename()));
}
Mansoor answered 29/6, 2021 at 13:37 Comment(2)
simple and sweet, worked great within a jar in a dockerDupe
You are great man. All next generations should to remember and respect to you. Thank you very mutch.Peachy
S
6

Here is an example of using Reflections library to recursively scan classpath by regex name pattern augmented with a couple of Guava perks to to fetch resources contents:

Reflections reflections = new Reflections("com.example.package", new ResourcesScanner());
Set<String> paths = reflections.getResources(Pattern.compile(".*\\.template$"));

Map<String, String> templates = new LinkedHashMap<>();
for (String path : paths) {
    log.info("Found " + path);
    String templateName = Files.getNameWithoutExtension(path);
    URL resource = getClass().getClassLoader().getResource(path);
    String text = Resources.toString(resource, StandardCharsets.UTF_8);
    templates.put(templateName, text);
}

This works with both jars and exploded classes.

Stoeber answered 3/8, 2015 at 10:3 Comment(1)
Beware that reflections still doesn't support Java 9 and above: github.com/ronmamo/reflections/issues/186. There are links to competing libraries there.Stoeber
D
5

Here's a method I wrote for a "run all JUnits under a package". You should be able to adapt it to your needs.

private static void findClassesInJar(List<String> classFiles, String path) throws IOException {
    final String[] parts = path.split("\\Q.jar\\\\E");
    if (parts.length == 2) {
        String jarFilename = parts[0] + ".jar";
        String relativePath = parts[1].replace(File.separatorChar, '/');
        JarFile jarFile = new JarFile(jarFilename);
        final Enumeration<JarEntry> entries = jarFile.entries();
        while (entries.hasMoreElements()) {
            final JarEntry entry = entries.nextElement();
            final String entryName = entry.getName();
            if (entryName.startsWith(relativePath)) {
                classFiles.add(entryName.replace('/', File.separatorChar));
            }
        }
    }
}

Edit: Ah, in that case, you might want this snippet as well (same use case :) )

private static File findClassesDir(Class<?> clazz) {
    try {
        String path = clazz.getProtectionDomain().getCodeSource().getLocation().getFile();
        final String codeSourcePath = URLDecoder.decode(path, "UTF-8");
        final String thisClassPath = new File(codeSourcePath, clazz.getPackage().getName().repalce('.', File.separatorChar));
    } catch (UnsupportedEncodingException e) {
        throw new AssertionError("impossible", e);
    }
}
Dombrowski answered 15/9, 2009 at 19:39 Comment(1)
I guess the big problem is to know the jar file name in first place. It is the jar where the Main-Class: lives.Macule
B
3

A jar file is just a zip file with a structured manifest. You can open the jar file with the usual java zip tools and scan the file contents that way, inflate streams, etc. Then use that in a getResourceAsStream call, and it should be all hunky dory.

EDIT / after clarification

It took me a minute to remember all the bits and pieces and I'm sure there are cleaner ways to do it, but I wanted to see that I wasn't crazy. In my project image.jpg is a file in some part of the main jar file. I get the class loader of the main class (SomeClass is the entry point) and use it to discover the image.jpg resource. Then some stream magic to get it into this ImageInputStream thing and everything is fine.

InputStream inputStream = SomeClass.class.getClassLoader().getResourceAsStream("image.jpg");
JPEGImageReaderSpi imageReaderSpi = new JPEGImageReaderSpi();
ImageReader ir = imageReaderSpi.createReaderInstance();
ImageInputStream iis = new MemoryCacheImageInputStream(inputStream);
ir.setInput(iis);
....
ir.read(0); //will hand us a buffered image
Buckeen answered 15/9, 2009 at 19:35 Comment(4)
This jar contains the main program and the resources. How do I refer to the self jar? from within the jar file?Macule
To refer to the JAR file, just use "blah.JAR" as the String. You can use new File("blah.JAR") to create a File object that represents the JAR, for example. Just replace "blah.JAR" with the name of your JAR.Octavo
If its the same jar that you are already running out of, the class loader should be able to see stuff inside the jar ... I misunderstood what you were trying to do initially.Buckeen
Well yes, I already have that, the problem is when I need something like: "...getResourceAsStream("*.jpg"); ... " That is, dynamically, list the files contained.Macule
H
3

Given an actual JAR file, you can list the contents using JarFile.entries(). You will need to know the location of the JAR file though - you can't just ask the classloader to list everything it could get at.

You should be able to work out the location of the JAR file based on the URL returned from ThisClassName.class.getResource("ThisClassName.class"), but it may be a tiny bit fiddly.

Horrific answered 15/9, 2009 at 19:37 Comment(6)
Reading your answer another question raised. What would yield the call: this.getClass().getResource("/my_directory"); It should return an URL that could in turn be .... used as directory? Nahh... let me try it.Macule
You always know the location of the JAR - it's in ".". As long as the name of the JAR is known to be something, you can use a String constant somewhere. Now, if people go changing the name of the JAR...Octavo
@Thomas: That's assuming you're running the app from the current directory. What's wrong with "java -jar foo/bar/baz.jar"?Horrific
I believe (and would have to verify), that if you had code in your Jar that was new File("baz.jar), the File object would represent your JAR file.Octavo
@Thomas: I don't believe so. I believe it's would be relative to the current working directory of the process. I'd have to check too though :)Horrific
One liner for this solution: new JarFile(new File(ThisClassName.class.getProtectionDomain().getCodeSource().getLocation().toURI())).entries()Matador
T
3

Some time ago I made a function that gets classess from inside JAR:

public static Class[] getClasses(String packageName) 
throws ClassNotFoundException{
    ArrayList<Class> classes = new ArrayList<Class> ();

    packageName = packageName.replaceAll("\\." , "/");
    File f = new File(jarName);
    if(f.exists()){
        try{
            JarInputStream jarFile = new JarInputStream(
                    new FileInputStream (jarName));
            JarEntry jarEntry;

            while(true) {
                jarEntry=jarFile.getNextJarEntry ();
                if(jarEntry == null){
                    break;
                }
                if((jarEntry.getName ().startsWith (packageName)) &&
                        (jarEntry.getName ().endsWith (".class")) ) {
                    classes.add(Class.forName(jarEntry.getName().
                            replaceAll("/", "\\.").
                            substring(0, jarEntry.getName().length() - 6)));
                }
            }
        }
        catch( Exception e){
            e.printStackTrace ();
        }
        Class[] classesA = new Class[classes.size()];
        classes.toArray(classesA);
        return classesA;
    }else
        return null;
}
Thereby answered 27/5, 2011 at 23:28 Comment(0)
S
2
public static ArrayList<String> listItems(String path) throws Exception{
    InputStream in = ClassLoader.getSystemClassLoader().getResourceAsStream(path);
    byte[] b = new byte[in.available()];
    in.read(b);
    String data = new String(b);
    String[] s = data.split("\n");
    List<String> a = Arrays.asList(s);
    ArrayList<String> m = new ArrayList<>(a);
    return m;
}
Screening answered 24/8, 2019 at 19:23 Comment(2)
While this code snippet may solve the problem, it doesn't explain why or how it answers the question. Please include an explanation for your code, as that really helps to improve the quality of your post. Remember that you are answering the question for readers in the future, and those people might not know the reasons for your code suggestion.Dissert
data is empty when we execute the code from a jar file.Earleneearley
S
1

There are two very useful utilities both called JarScan:

  1. www.inetfeedback.com/jarscan

  2. jarscan.dev.java.net

See also this question: JarScan, scan all JAR files in all subfolders for specific class

Sulfonmethane answered 5/10, 2009 at 7:21 Comment(0)
S
1

The most robust mechanism for listing all resources in the classpath is currently to use this pattern with ClassGraph, because it handles the widest possible array of classpath specification mechanisms, including the new JPMS module system. (I am the author of ClassGraph.)

How to know the name of the JAR file where my main class lives?

URI mainClasspathElementURI;
try (ScanResult scanResult = new ClassGraph().whitelistPackages("x.y.z")
        .enableClassInfo().scan()) {
    mainClasspathElementURI =
            scanResult.getClassInfo("x.y.z.MainClass").getClasspathElementURI();
}

How can I read the contents of a directory in a similar fashion within a JAR file?

List<String> classpathElementResourcePaths;
try (ScanResult scanResult = new ClassGraph().overrideClasspath(mainClasspathElementURI)
        .scan()) {
    classpathElementResourcePaths = scanResult.getAllResources().getPaths();
}

There are lots of other ways to deal with resources too.

Sukin answered 5/10, 2019 at 11:20 Comment(0)
C
1

One more for the road that's a bit more flexible for matching specific filenames because it uses wildcard globbing. In a functional style this could resemble:

import java.io.IOException;
import java.net.URISyntaxException;
import java.nio.file.FileSystem;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;
import java.util.function.Consumer;

import static java.nio.file.FileSystems.getDefault;
import static java.nio.file.FileSystems.newFileSystem;
import static java.util.Collections.emptyMap;

/**
 * Responsible for finding file resources.
 */
public class ResourceWalker {
  /**
   * Globbing pattern to match font names.
   */
  public static final String GLOB_FONTS = "**.{ttf,otf}";

  /**
   * @param directory The root directory to scan for files matching the glob.
   * @param c         The consumer function to call for each matching path
   *                  found.
   * @throws URISyntaxException Could not convert the resource to a URI.
   * @throws IOException        Could not walk the tree.
   */
  public static void walk(
    final String directory, final String glob, final Consumer<Path> c )
    throws URISyntaxException, IOException {
    final var resource = ResourceWalker.class.getResource( directory );
    final var matcher = getDefault().getPathMatcher( "glob:" + glob );

    if( resource != null ) {
      final var uri = resource.toURI();
      final Path path;
      FileSystem fs = null;

      if( "jar".equals( uri.getScheme() ) ) {
        fs = newFileSystem( uri, emptyMap() );
        path = fs.getPath( directory );
      }
      else {
        path = Paths.get( uri );
      }

      try( final var walk = Files.walk( path, 10 ) ) {
        for( final var it = walk.iterator(); it.hasNext(); ) {
          final Path p = it.next();
          if( matcher.matches( p ) ) {
            c.accept( p );
          }
        }
      } finally {
        if( fs != null ) { fs.close(); }
      }
    }
  }
}

Consider parameterizing the file extensions, left an exercise for the reader.

Be careful with Files.walk. According to the documentation:

This method must be used within a try-with-resources statement or similar control structure to ensure that the stream's open directories are closed promptly after the stream's operations have completed.

Likewise, newFileSystem must be closed, but not before the walker has had a chance to visit the file system paths.

Ceremonious answered 23/6, 2020 at 6:26 Comment(0)
S
0

Just a different way of listing/reading files from a jar URL and it does it recursively for nested jars

https://gist.github.com/trung/2cd90faab7f75b3bcbaa

URL urlResource = Thead.currentThread().getContextClassLoader().getResource("foo");
JarReader.read(urlResource, new InputStreamCallback() {
    @Override
    public void onFile(String name, InputStream is) throws IOException {
        // got file name and content stream 
    }
});
Speedboat answered 17/6, 2015 at 3:0 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.