Hash Tables

S1ght · Oct 23, 2008

Hi all,

Am having some trouble understanding hash tables/collision handling with random access files. I understand how to hash a key to get the record number in the file and then how to get a position in the file, so I can read/write to random access files assuming there are no collisions.

Now when they share the same record number in the file is where I have problems, I've read a lot online about hash tables but still can't grasp it completely

I'm trying to implement it with Bucket Addressing with size of 5. There are 13 records in the file and am using Visual Basic.

So far I've gotten as far as declaring a variable of type hashfile and I have an array of records where the record has the key value in it...

Can anyone maybe explain this in a nice easy to understand way

Thanx in advance
S1ght

4cer · Oct 23, 2008

go clubbing... find a girl.. get laid... = no problem with "hash tables"

S1ght · Oct 23, 2008

Should i post links to all the times you asked for help with homework?

fxit_man · Oct 23, 2008

Pyro · Oct 23, 2008

If you're writing to a record that's already taken, it keeps looking forward until it finds an unused block, and uses that.

Or am I understanding your question wrong?

xsist10 · Oct 23, 2008

I think your hash size may be your problem. Maybe increase the size of your hash to avoid collisions. SHA1 and MD5 both return 32 byte hashes

S1ght · Oct 23, 2008

Pyro said:
If you're writing to a record that's already taken, it keeps looking forward until it finds an unused block, and uses that.

Or am I understanding your question wrong?

That's one method of doing it but the problem with that is that it can cause it to occupy spaces of other records, so say you have a record at position 2 and position 3 is still empty so it goes there but the next record you try to write was meant to go to 3 so it then goes to 4 so its not the best method if you want most things to go to their original positions

FarligOpptreden · Oct 24, 2008

Wow - haven't used hash tables in ages. Haven't really had any use for them either... Will have to think about this a bit, but don't count on a decent reply

Veroland · Oct 24, 2008

I'm trying to implement it with Bucket Addressing with size of 5. There are 13 records in the file and am using Visual Basic.

You can't do Buckets with VB, use a real language like c

stoke · Oct 24, 2008

You cannot expect an addressing mechanism like hashing to solve collision handling for you.

Raithlin · Oct 24, 2008

S1ght said:
... am using Visual Basic.

See now, there's your problem right there.

Veroland said:
You can't do Buckets with VB, use a real language like c

+1

Seriously, why VB? Why not a more modern language (at the very least, VB.Net)?

S1ght · Oct 25, 2008

Visual Basic is just our 1 subject where they made us take it i swear

we do c++ as our major.

Pyro · Oct 25, 2008

S1ght said:
That's one method of doing it but the problem with that is that it can cause it to occupy spaces of other records, so say you have a record at position 2 and position 3 is still empty so it goes there but the next record you try to write was meant to go to 3 so it then goes to 4 so its not the best method if you want most things to go to their original positions

That's a shortcoming of a hash table. But a hash table was never meant to be perfect, just better than different methods. You can try other methods of collision resolution, but they'll usually have drawbacks of their own.

Since you're working with a file, you can go and physically shift everything up to the next position so that you can squeeze your new value into the optimum position, but that means anything read on the first hash takes longer to find. You gain performance on the one read, but lose it on another, and your insert is MUCH slower.

Either way, you didn't mention much on what the problem was, so it's kinda hard to solve it...

phiber · Oct 25, 2008

Im with Pyro, whats your problem exactly ? Maybe im just not understanding clearly? Instead of storing values store link lists (so if two values collide, store the second after the first and keep track of an index somehow), as far as i know that is one way to implement hash tables. I might just be understanding your question wrong though....

Veroland · Oct 25, 2008

From what I can see he is try is trying to to use hash index's on buckets but the problem seems to be trying to use a file as a db with read write without locking. Files are not a good storage system medium. Unless you are on a AS400 for example.....

MielieSpoor · Oct 29, 2008

S1ght said:
Visual Basic is just our 1 subject where they made us take it i swear we do c++ as our major.

BIT @ Tuks and taking an Informatics subject maybe?

S1ght · Oct 29, 2008

BSc IT at UJ, Informatics(VB) and Computer Science(c++ and Java), got some friends doing BIT at Tuks though

MielieSpoor · Oct 29, 2008

When I saw the C++ and compulsory VB, I just knew there is some form of Informatics in the mix...

Join the MyBroadband community

Get started

Hash Tables

S1ght

Expert Member

4cer

Expert Member

S1ght

Expert Member

fxit_man

Executive Member

Pyro

Expert Member

xsist10

Active Member

S1ght

Expert Member

FarligOpptreden

Executive Member

Veroland

Executive Member

stoke

Honorary Master

Raithlin

Executive Member

S1ght

Expert Member

Pyro

Expert Member

phiber

Expert Member

Veroland

Executive Member

MielieSpoor

Expert Member

S1ght

Expert Member

MielieSpoor

Expert Member