I'm trying to write a build.rs
script that creates an up-to-date HashMap
that maps the first 6 characters of a MAC address with its corresponding vendor.
It has 29231 key-value pairs which causes cargo check
to spend more than 7 minutes on my source code. Before this, it was less than 20 seconds. It also uses all 8GB of the RAM available on my laptop and I cannot use it during those 7-8 minutes.
I think this is either a rustc
/cargo
bug, or I am doing something wrong, and I'm pretty sure is the latter. What is the correct way of generating code like this?
main.rs
use std::collections::{HashMap, HashSet};
use rustc_hash::{FxHashMap, FxHashSet, FxHasher};
type CustomHasher = BuildHasherDefault<FxHasher>;
include!(concat!(env!("OUT_DIR"), "/map_oui.rs"));
map_oui.rs
#[rustfmt::skip]
lazy_static! {
static ref MAP_MACS: FxHashMap<&'static [u8; 6], &'static str> = {
let mut map_macs = HashMap::with_capacity_and_hasher(29231, CustomHasher::default());
map_macs.insert(b"002272", "American Micro-Fuel Device Corp.");
map_macs.insert(b"00D0EF", "IGT");
//...
build.rs
use std::env;
use std::fs::File;
use std::io::prelude::*;
use std::io::{BufReader, BufWriter};
use std::path::Path;
fn main() {
let out_dir = env::var_os("OUT_DIR").unwrap();
let dest_path = Path::new(&out_dir).join("map_oui.rs");
let handle = File::create(dest_path).unwrap();
let mut writer = BufWriter::new(handle);
let response = ureq::get("http://standards-oui.ieee.org/oui.txt")
.call()
.expect("Conection Error");
let mut reader = BufReader::new(response.into_reader());
let mut line = Vec::new();
writer
.write(
b"#[rustfmt::skip]
lazy_static! {
static ref MAP_MACS: FxHashMap<&'static [u8; 6], &'static str> = {
let mut map_macs = HashMap::with_capacity_and_hasher(29231, CustomHasher::default());\n",
)
.unwrap();
loop {
match reader.read_until('\n' as u8, &mut line) {
Ok(bytes_read) => {
if bytes_read == 0 {
break;
}
if line.get(12..=18).map_or(false, |s| s == b"base 16") {
let mac_oui = String::from_utf8_lossy(&line[0..6]);
let vendor = String::from_utf8_lossy(&line[22..]);
writer.write(b" map_macs.insert(b\"").unwrap();
writer.write(mac_oui.as_bytes()).unwrap();
writer.write(b"\", \"").unwrap();
writer.write(vendor.trim().as_bytes()).unwrap();
writer.write(b"\");\n").unwrap();
}
line.clear();
}
Err(_) => (),
}
}
writer
.write(
b" map_macs
};
}
",
)
.unwrap();
writer.flush().unwrap();
println!("cargo:rerun-if-changed=build.rs");
}
[(key, value)]
. Another workaround would be to move all of this to a different crate completely; that way it should be built less frequently. – Sforzarust-phf
seemed promising but it is slightly slower. I followed @Becalmed and @Sforza suggestions and it worked. Currentlybuild.rs
generates aconst MAP_MACS: [([u8; 6], &str); 29246]
and I wrote a wrapper function calledvendor_lookup
around a binary search of the array. Should I post the code as an answer for future reference? – Archibaldo