Maximum number of rows in an MS Access database engine table?
Asked Answered
S

8

16

We know the MS Access database engine is 'throttled' to allow a maximum file size of 2GB (or perhaps internally wired to be limited to fewer than some power of 2 of 4KB data pages). But what does this mean in practical terms?

To help me measure this, can you tell me the maximum number of rows that can be inserted into a MS Access database engine table?

To satisfy the definition of a table, all rows must be unique, therefore a unique constraint (e.g. PRIMARY KEY, UNIQUE, CHECK, Data Macro, etc) is a requirement.

EDIT: I realize there is a theoretical limit but what I am interested in is the practical (and not necessarily practicable), real life limit.

Sheepfold answered 3/8, 2009 at 9:35 Comment(3)
It will be possible to insert duplicate rows updating the rpimary key value of a record.Loveinidleness
@Ralph Rickenbach: thanks, I hadn't thought of that (doh!) I've edited the question to make a unique constraint a requirement.Sheepfold
I don't think it's correct to say Jet/ACE is "throttled" insomuch as it's probably internally wired to be limited to fewer than some power of 2 of 4KB data pages (I'm not getting any nice round numbers trying this out, so there must be some overhead in there somewhere). While it might not be difficult to change, it wouldn't serve the interests of MS's larger product lines, particularly given that they really do THROTTLE SQL Server Express at 4GBs -- we know it's the same engine as full SQL Server, but with an artificial limiation.Singer
S
13

Here's my attempt:

I created a single-column (INTEGER) table with no key:

CREATE TABLE a (a INTEGER NOT NULL);

Inserted integers in sequence starting at 1.

I stopped it (arbitrarily after many hours) when it had inserted 65,632,875 rows. The file size was 1,029,772 KB.

I compacted the file which reduced it very slightly to 1,029,704 KB.

I added a PK:

ALTER TABLE a ADD CONSTRAINT p PRIMARY KEY (a);

which increased the file size to 1,467,708 KB.

This suggests the maximum is somewhere around the 80 million mark.

Sheepfold answered 6/8, 2009 at 9:43 Comment(0)
S
14

Some comments:

  1. Jet/ACE files are organized in data pages, which means there is a certain amount of slack space when your record boundaries are not aligned with your data pages.

  2. Row-level locking will greatly reduce the number of possible records, since it forces one record per data page.

  3. In Jet 4, the data page size was increased to 4KBs (from 2KBs in Jet 3.x). As Jet 4 was the first Jet version to support Unicode, this meant that you could store 1GB of double-byte data (i.e., 1,000,000,000 double-byte characters), and with Unicode compression turned on, 2GBs of data. So, the number of records is going to be affected by whether or not you have Unicode compression on.

  4. Since we don't know how much room in a Jet/ACE file is taken up by headers and other metadata, nor precisely how much room index storage takes, the theoretical calculation is always going to be under what is practical.

  5. To get the most efficient possible storage, you'd want to use code to create your database rather than the Access UI, because Access creates certain properties that pure Jet does not need. This is not to say there are a lot of these, as properties set to the Access defaults are usually not set at all (the property is created only when you change it from the default value -- this can be seen by cycling through a field's properties collection, i.e., many of the properties listed for a field in the Access table designer are not there in the properties collection because they haven't been set), but you might want to limit yourself to Jet-specific data types (hyperlink fields are Access-only, for instance).

I just wasted an hour mucking around with this using Rnd() to populate 4 fields defined as type byte, with composite PK on the four fields, and it took forever to append enough records to get up to any significant portion of 2GBs. At over 2 million records, the file was under 80MBs. I finally quit after reaching just 700K 7 MILLION records and the file compacted to 184MBs. The amount of time it would take to get up near 2GBs is just more than I'm willing to invest!

Singer answered 3/8, 2009 at 21:32 Comment(5)
Then length of time to get the test MDB up is probably because of the time spent indexing the composite key. I wonder if it would've been a lot faster just using a auto number key.Vitiate
Actually, I'm pretty sure the reason the last batch took so long was because there were 9000+ dupes to be discarded (appending 2 million records). What I did was stop using Rnd() and used the table as source, just swapping the columns. This did, of course, result in some duplicates, and I'm certain that's what caused it to take over an hour to append those records.Singer
BTW, I'm right now copying this over to an older PC to finish appending up to 2GBs to see how many records it comes out with. That way I can let it run without it bogging down the machine I'm working on (and it was pretty doggy yesterday when I was running it).Singer
Just an update: I'm now up to over 16 million records, and the file is just over 420MBs. I'm starting one more round of appending random data. The last batch had 2 million dupes discarded, so I expect more discards on this one.Singer
I'm stopping for the night. The database now has 21.7 million records (11 million of the last batch were dupes and rejected), and after a compact is just over 600MBs. My method of generating unique records is obviously less effective than it was before, so I think I'll stop now.Singer
S
13

Here's my attempt:

I created a single-column (INTEGER) table with no key:

CREATE TABLE a (a INTEGER NOT NULL);

Inserted integers in sequence starting at 1.

I stopped it (arbitrarily after many hours) when it had inserted 65,632,875 rows. The file size was 1,029,772 KB.

I compacted the file which reduced it very slightly to 1,029,704 KB.

I added a PK:

ALTER TABLE a ADD CONSTRAINT p PRIMARY KEY (a);

which increased the file size to 1,467,708 KB.

This suggests the maximum is somewhere around the 80 million mark.

Sheepfold answered 6/8, 2009 at 9:43 Comment(0)
V
5

As others have stated it's combination of your schema and the number of indexes.

A friend had about 100,000,000 historical stock prices, daily closing quotes, in an MDB which approached the 2 Gb limit.

He pulled them down using some code found in a Microsoft Knowledge base article. I was rather surprised that whatever server he was using didn't cut him off after the first 100K records.

He could view any record in under a second.

Vitiate answered 3/8, 2009 at 17:3 Comment(3)
I would say that to get maximum space, you'd also want to create your database in code, so it's pure Jet, without any of the unnecessary Access-specific features.Singer
@Tony Toews: this question is about practical limits. No anecdotes, thanks.Sheepfold
...but a link to that Microsoft Knowledge base article could get you some rep :)Sheepfold
A
2

It's been some years since I last worked with Access but larger database files always used to have more problems and be more prone to corruption than smaller files.

Unless the database file is only being accessed by one person or stored on a robust network you may find this is a problem before the 2GB database size limit is reached.

Annexation answered 3/8, 2009 at 14:57 Comment(3)
-1 Using another SQL product will not help me find the practical limit of an Access database engine table.Sheepfold
I didn't realise I was advocating the use of another SQL product in my comment? I was pointing out that an important factor in determining the practical limit (whatever that might mean) of the size of an Access database is a network's ability to both serve a large file and serve this large file to multiple users concurrently.Annexation
A properly-designed front-end application won't be retrieving any more data for a Jet/ACE back end with 1 million records than for one with 1000 records because Jet/ACE retrieves the index pages for the requested data set, then only the data pages storing those records. This will be exactly the same amount of data in either context. It's only if you're doing full table scans (because you're an idiot) that your scenario would be relevant.Singer
S
1

We're not necessarily talking theoretical limits here, we're talking about real world limits of the 2GB max file size AND database schema.

  • Is your db a single table or multiple?
  • How many columns does each table have?
  • What are the datatypes?

The schema is on even footing with the row count in determining how many rows you can have.

We have used Access MDBs to store exports of MS-SQL data for statistical analysis by some of our corporate users. In those cases we've exported our core table structure, typically four tables with 20 to 150 columns varying from a hundred bytes per row to upwards of 8000 bytes per row. In these cases, we would bump up against a few hundred thousand rows of data were permissible PER MDB that we would ship them.

So, I just don't think that this question has an answer in absence of your schema.

Spiculum answered 3/8, 2009 at 14:47 Comment(4)
I don't have a schema. The challenge is for you to come up with a schema that will yield the table with the most rows possible (hint: I'm thinking a sole single-column table).Sheepfold
Row-level locking also makes a difference, as it forces a single data page for each record.Singer
Yes, the answer will be in the millions and will take a long time to achieve. I would be surprised if no one has done this before :)Sheepfold
It is confusing, I know (see my edits to the question!), but I mean 'practical' meaning 'can be done' and not 'practicable' meaning 'worth doing'. However, I think it is worth knowing the practical limit because I suspect it would give some indication that the practicable limit exceeds the theoretical limit. Any clearer? :)Sheepfold
G
0

It all depends. Theoretically using a single column with 4 byte data type. You could store 300 000 rows. But there is probably alot of overhead in the database even before you do anything. I read some where that you could have 1.000.000 rows but again, it all depends..

You can also link databases together. Limiting yourself to only disk space.

Gonfalon answered 3/8, 2009 at 9:57 Comment(2)
The math of 4-byte records and 300K rows does not add up at all. I've had databases in long-term production use with over 300K rows in 3 tables (and one table with twice that) and the records were a lot more complicated than a single 4-byte field. The data file was substantially less than 500MBs in size (more like half that).Singer
yes you're right. Im missing some zeroes. (2 gigabytes) / (4 Bytes) = 536 870 912Gonfalon
O
0

Practical = 'useful in practice' - so the best you're going to get is anecdotal. Everything else is just prototyping and testing results.

I agree with others - determining 'a max quantity of records' is completely dependent on schema - # tables, # fields, # indexes.

Another anecdote for you: I recently hit 1.6GB file size with 2 primary data stores (tables), of 36 and 85 fields respectively, with some subset copies in 3 additional tables.

Who cares if data is unique or not - only material if context says it is. Data is data is data, unless duplication affects handling by the indexer.

The total row counts making up that 1.6GB is 1.72M.

Otolaryngology answered 16/5, 2012 at 21:44 Comment(1)
I'll let you in on a secret: this was just meant to be a fun challenge :) By 'practical' I mean in practice, as opposed to in theory, so anecdotes aren't what I am after, thanks.Sheepfold
M
0

When working with 4 large Db2 tables I have not only found the limit but it caused me to look really bad to a boss who thought that I could append all four tables (each with over 900,000 rows) to one large table. the real life result was that regardless of how many times I tried the Table (which had exactly 34 columns - 30 text and 3 integer) would spit out some cryptic message "Cannot open database unrecognized format or the file may be corrupted". Bottom Line is Less than 1,500,000 records and just a bit more than 1,252,000 with 34 rows.

Mephitic answered 8/6, 2014 at 6:48 Comment(0)

© 2022 - 2025 — McMap. All rights reserved.