What's the difference between identifying and non-identifying relationships?
Asked Answered
R

15

885

I haven't been able to fully grasp the differences. Can you describe both concepts and use real world examples?

Rotorua answered 18/4, 2009 at 5:4 Comment(1)
Good question, wheel is not be reinvented: Peter Chen. The Entity Relationship Model, Toward a Unified View of Data, 1976 § 2.3.2: "If relationships are used for identifying the entities, we shall call it a weak entity relation. If relationships are not used for identifying the entities, we shall call it a regular entity relation". The OP question boils down to: What are weak/regular entity relations?.Maneater
M
1160
  • An identifying relationship is when the existence of a row in a child table depends on a row in a parent table. This may be confusing because it's common practice these days to create a pseudokey for a child table, but not make the foreign key to the parent part of the child's primary key. Formally, the "right" way to do this is to make the foreign key part of the child's primary key. But the logical relationship is that the child cannot exist without the parent.

    Example: A Person has one or more phone numbers. If they had just one phone number, we could simply store it in a column of Person. Since we want to support multiple phone numbers, we make a second table PhoneNumbers, whose primary key includes the person_id referencing the Person table.

    We may think of the phone number(s) as belonging to a person, even though they are modeled as attributes of a separate table. This is a strong clue that this is an identifying relationship (even if we don't literally include person_id in the primary key of PhoneNumbers).

  • A non-identifying relationship is when the primary key attributes of the parent must not become primary key attributes of the child. A good example of this is a lookup table, such as a foreign key on Person.state referencing the primary key of States.state. Person is a child table with respect to States. But a row in Person is not identified by its state attribute. I.e. state is not part of the primary key of Person.

    A non-identifying relationship can be optional or mandatory, which means the foreign key column allows NULL or disallows NULL, respectively.


See also my answer to Still Confused About Identifying vs. Non-Identifying Relationships

Mota answered 18/4, 2009 at 5:59 Comment(26)
+1: Bill, "it's common practice these days to create a pseudokey for a child table, but not make the foreign key to the parent part of the child's primary key" - any links as to why this is? Google is failing me.Aldaaldan
@hobodave: It's the "convention over configuration" argument. Some schools of thought are that every table should define its primary key for a single-column pseudokey named id that auto-generates its values. Application frameworks like Rails have popularized this as a default. They treat natural keys and multi-column keys as divergent from their conventions, needed when using "legacy" databases. Many other frameworks have followed this lead.Mota
It seems like "properly" constructing identifying relationships would lead to obnoxiously huge primary keys. e.g. Building has Floor has Room has Bed. The PK for Bed would be (bed_id, floor_id, room_id, building_id). It seem's strange that I've never seen this in practice, nor heard it suggested as a way to do anything. That's a lot of redundant data in the PK.Aldaaldan
@hobodave: I have seen multi-column primary keys that are even larger. But I take your point. Consider that multi-column primary keys convey more information; you can query the Beds table for all beds in a specific building without doing any joins.Mota
@Bill Karwin If I have user table and there are fields user_id and updated_by and I would like update_by to reference the user_id, then relation must be non-identifying I think. Could you tell me am I correct in that part?Issacissachar
@Eugene, yes I would expect that to be a non-identifying relationship. user_id should be the primary key by itself, and updated_by is not part of a multi-column primary key.Mota
What if a child depends on two parents?, I suppose it will be needed a composed primary key of 3 { id, parentId1, parentId2 }.Monjan
@leoMestizo, yes, any weak entity would have an identifying relationship with its parent entity table. The phone number example would be a good case.Mota
@BillKarwin can an identifying relatonship have Mandatory/Optional??Hedelman
@IvanMatala, by definition a foreign key in an identifying relationship is part of the child table's primary key. Primary keys cannot be null, so the answer is no, an identifying relationship cannot be optional.Mota
I have read DBMS's book Elmasri navathe (and some chapters from korth also) sincerely but I didn't read this terms "identifying relationship"!! Is there any other any other terminology used for this?Banket
The example given of an identifying relationship is bad. Phone numbers change ownership over time. I would also have liked to see what the point is- why does the distinction matter?Maugre
@mheiber, yes, phone numbers change ownership, but they have one owner at any given point in time. So you can change the foreign key to another user.Mota
@BillKarwin Thanks. But how is that an identifying relationship, then? The identities of phone numbers and people don't seem to be interdependent.Maugre
@mheiber, identifying relationship just means that the foreign key in the phone numbers table is also part of that table's primary key, and therefore must be NOT NULL. The phone number can be reassigned to another user, but it must always reference one user or another.Mota
I know it's an old thread, but for the non-identifying, can the child exist without their State in your example i.e: NULL? The State is deleted, but the Person still exists?Umbel
@Atieh, yes, a non-identifying relationship allows the Person to exist without referencing any State.Mota
@BillKarwin the example of Person and PhoneNumber is not quite good. If you attach Person.id into the primary key of PhoneNumber then you will lose data integrity, as you can create a row in PhoneNumber that has a phone id for 2 people, e.g., the pair (123, 45) and (123, 46) will not violate any constraint and thats not what you want.Exhume
@Sebastian, that's not a data integrity problem, because the primary key is still unique. Think of it this way: if the PhoneNumber primary key were (person_id, sequence) and sequence starts numbering at 1 for each person, then you'd have many persons with the same value for sequence. The sequence column alone is not required to be unique, because the combination of the two columns is unique.Mota
@BillKarwin I didn't meant data integrity for the engine. Your data model will allow you to do things you don't want to.Exhume
In other words, your model allows you to have a phone number attached to more than 1 personExhume
@Sebastian, you're assuming that a given id value refers to the same phone instance, which is not necessarily true. The combination of columns in the primary key identifies the phone. In your model, you break 2nd normal form by relying on part of the key.Mota
@BillKarwin I'm not assuming. I'm using what the model could model. Your PhoneNumber table will have PK(Person.id, PhoneNumber.id). That makes a combination of 2 values to indentify a row. Again, you could have (45, 123) and (46, 123). Those values represents 2 rows as they are differents primary keys. They also references 2 different rows in the Person table. Now, you need to ensure that fields corresponding the telephone number for those rows be different.Exhume
I will never use this to model that. The best answer is from "aqsa rao" below that states the following: "An identifying relationship means that the child table cannot be uniquely identified without the parent." Because your definition is adding unecessary semantic that could confuse people. It's not the dependency between entities the reason you create an identifying or non-indentifying relationship.Exhume
@Sebastian, well, I can only suggest you study what Second Normal Form means to this case.Mota
"An identifying relationship is when the existence of a row in a child table depends on a row in a parent table." This only means that the child row existentially depends on the parent row, it does not imply an identifying relationship. An identifying relationship is a special case of existential dependence. For example, we could say that a Worker has to be associated with one and only one Project (existential dependence) but this not imply that the Worker will have the Project.ID as part of it's primary key.Virulent
K
959

There is another explanation from the real world:

A book belongs to an owner, and an owner can own multiple books. But, the book can exist also without the owner, and ownership of it can change from one owner to another. The relationship between a book and an owner is a non-identifying relationship.

A book, however, is written by an author, and the author could have written multiple books. But, the book needs to be written by an author - it cannot exist without an author. Therefore, the relationship between the book and the author is an identifying relationship.

Kutzer answered 6/3, 2010 at 12:54 Comment(8)
A decent explanation but I believe it's also instructive to extend the example a little bit. A book has a number of pages. It cannot exist without pages and therefore we might conclude that the relationship between a book and the number of pages it has is also an identifying relationship. But will the number of pages attribute be part of any identification scheme (key) for the book? Probably not. The term "identifying relationship" is normally reserved for relationships involving identifying attributes - prime attributes in relational terms.Bieber
What happens if the book was written by more than 1 author? It's not identifying relationship any more as M:N type, why?Monteith
These real examples are useless. When you realize how you create tables in ER and how data integrity will hold, you then throw away these examples. If you create a strong relationship between two entities, you are forcing to create a primary key in the entity table combined with PK from the other entity. If your model allows you that the same book can have multiple authors, then it's ok to be strong. But if your model only allows you 1 author 1 book, you can't have that constraint using strong relationship because PK(Book.id, Book.person_id).Exhume
but the real usage is "can book exist without the author ?". The answer is yes in reality, because people will look for the book directly. Therefore in practice, for this case, they should be always be "non-identifying relationship".Beadledom
what is going on guys !!, This is not a valid example for the Identifying relationship !!! yes a book can't be written without an author but , the author field in the books table is NOT IDENTIFYING the book row !!!Internment
To make it technical now, the author_id should propagate as both FK and PK in "books". Assuming a book has no copies, owner_id must also propagate as FK, but not as PK. The reason why you never actually see the first identifying relationship in real world database design is we prefer to identify books by ISBN and look at them as independent. A book may be also written by more then one author, and may have multiple copies.Swat
Your example is valid if and only if the author has written only that book and the book table does not have any other field to identify it properly (like its ISBN). So this answer is not valid in a real world application. How can I prove this? Try to find Hamlet in Amazon using its author field. Why does the Romeo and Juliet tragedy appear? Maybe did Shakespeare write it too?Stralsund
This is a great example to understand the concept. You should add the relation type depends on the use case; If I was building a database for the local library Author would be a non-identifying relationship. If I was building a biography website for book authors it would probably be identifying.Covalence
L
56

Bill's answer is correct, but it is shocking to see that among all the other answers no one points out the most significant aspect.

It has been said over and over again, that in an identifying relationship the child can not exist without the parent. (e.g. user287724). This is true, but completely misses the point. It would be enough for the foreign key to be non-null to achieve this. It does not need to be part of the primary key.

So here is the real reason:

The purpose of an identifying relationship is that the foreign key should NEVER CHANGE, because it is part of the primary key... therefore identifying!!!

Lysenko answered 4/7, 2017 at 12:39 Comment(3)
+1 for "It would be enough enough for the foreign key to be non-null, to achieve this." It should be enough, but unfortunately it's not when it comes to something like Entity Framework, which doesn't work right unless you use an identifying relationship, but then the "Id" field of an entity loses it's uniqueness as a result of being just a part of a composite key.Gipon
i don't get a point in primary key -> could never changeOrphism
@Orphism Even if your database happens to support updating primary keys in some form (eg. via triggers), there are many arguments against such practice. Storage is cheap... stop mutating your database! Append-only databases benefit enormously from scalability, independent read-vs-write concurrency, auditability, and repeatable/re-playable downstream processes.Lysenko
S
24

An Identifying relationship specifies that a child object cannot exist without the parent object

Non-identifying relationships specifies a regular association between objects, 1:1 or 1:n cardinality.

Non-identifying relationships can be specified as optional where a parent is not required or mandatory where a parent is required by setting the parent table cardinality...

Snuggle answered 18/4, 2009 at 5:45 Comment(3)
This sounds more like a description of total participation in a relationship, than of an identifying relationship.Flippant
I disagree with the above definitions. You may have an object that depends on its parent and you want that object to be constrained to be linked only with 1 parent row. A House has Walls. You remove house and you don't have walls. But a wall belongs only to a house. So doing strong-relationship won't work: PK(Wall.id, House.id) will allow you to insert into the model the same wall to another house.Exhume
The reason to have the House_Wall table is to identifying a Wall inside a House. It's the identifying relationship. The House_Wall table is like PK(House.id, wall_number), FK(Wall.id). The wall_number is a sequence in a house and not unique without House.id. If you want to model like PK(Wall.id, House.id) and Wall.id must be unique, then it's enough to have House.id in the Wall table as FK. It's just trying to model different things.Reflation
Y
16

Here's a good description:

Relationships between two entities may be classified as being either "identifying" or "non-identifying". Identifying relationships exist when the primary key of the parent entity is included in the primary key of the child entity. On the other hand, a non-identifying relationship exists when the primary key of the parent entity is included in the child entity but not as part of the child entity's primary key. In addition, non-identifying relationships may be further classified as being either "mandatory" or "non-mandatory". A mandatory non-identifying relationship exists when the value in the child table cannot be null. On the other hand, a non-mandatory non-identifying relationship exists when the value in the child table can be null.

http://www.sqlteam.com/article/database-design-and-modeling-fundamentals

Here's a simple example of an identifying relationship:

Parent
------
ID (PK)
Name

Child
-----
ID (PK)
ParentID (PK, FK to Parent.ID) -- notice PK
Name

Here's a corresponding non-identifying relationship:

Parent
------
ID (PK)
Name

Child
-----
ID (PK)
ParentID (FK to Parent.ID) -- notice no PK
Name
Yorke answered 18/4, 2009 at 5:51 Comment(2)
Your answer conflicts with that given by Bill Karwin, in the difference between whether the Foreign Key "is not" or "must not" be part of the Primary Key in the Child row.Quadricycle
@Andy White But could the primary key of the parent in an identifying relationship be non-mandatory, i.e., null, when it is part of a three-column composite primary key?Conventioneer
T
16

Non-identifying relationship

A non-identifying relationship means that a child is related to parent but it can be identified on its own.

PERSON    ACCOUNT
======    =======
pk(id)    pk(id)
name      fk(person_id)
          balance

The relationship between ACCOUNT and PERSON is non-identifying.

Identifying relationship

An identifying relationship means that the parent is needed to give identity to child. The child solely exists because of parent.

This means that foreign key is a primary key too.

ITEM      LANGUAGE    ITEM_LANG
====      ========    =========
pk(id)    pk(id)      pk(fk(item_id))
name      name        pk(fk(lang_id))
                      name

The relationship between ITEM_LANG and ITEM is identifying. And between ITEM_LANG and LANGUAGE too.

Tasman answered 22/5, 2014 at 14:31 Comment(3)
How is PERSON and ACCOUNT non - identifying? How can ACCOUNT exist without PERSON?Forenoon
why there is no answer for the previous comment? @ForenoonFogle
"How is PERSON and ACCOUNT non - identifying?" Because it's modeled as such. An account can be identified without knowing person_id. Why is it modeled as such? Because an account can have a different person as its owner in its lifetime. On the other hand, an ITEM_LANG cannot have a different ITEM in its lifetime. Although you can have different combinations, thier identities are different unlike the account-person relationship.Reflation
M
15

user287724's answer gives the following example of the book and author relationship:

A book however is written by an author, and the author could have written multiple books. But the book needs to be written by an author it cannot exist without an author. Therefore the relationship between the book and the author is an identifying relationship.

This is a very confusing example and is definitely not a valid example for an identifying relationship.

Yes, a book can not be written without at least one author, but the author(it's foreign key) of the book is NOT IDENTIFYING the book in the books table!

You can remove the author (FK) from the book row and still can identify the book row by some other field (ISBN, ID, ...etc) , BUT NOT the author of the book!!

I think a valid example of an identifying relationship would be the relationship between (products table) and a (specific product details table) 1:1

products table
+------+---------------+-------+--------+
|id(PK)|Name           |type   |amount  |
+------+---------------+-------+--------+
|0     |hp-laser-510   |printer|1000    |
+------+---------------+-------+--------+
|1     |viewsonic-10   |screen |900     |
+------+---------------+-------+--------+
|2     |canon-laser-100|printer|200     |
+------+---------------+-------+--------+

printers_details table
+--------------+------------+---------+---------+------+
|Product_ID(FK)|manufacturer|cartridge|color    |papers|
+--------------+------------+---------+---------+------+
|0             |hp          |CE210    |BLACK    |300   |
+--------------+------------+---------+---------+------+
|2             |canon       |MKJ5     |COLOR    |900   |
+--------------+------------+---------+---------+------+
* please note this is not real data

In this example the Product_ID in the printers_details table is considered a FK references the products.id table and ALSO a PK in the printers_details table , this is an identifying relationship because the Product_ID(FK) in the printers table IS IDENTIFYING the row inside the child table, we can't remove the product_id from the child table because we can't identify the row any more because we lost it's primary key

If you want to put it in 2 lines:

an identifying relationship is the relationship when the FK in the child table is considered a PK(or identifier) in the child table while still references the parent table

Another example may be when you have 3 tables (imports - products - countries) in an imports and exports for some country database

The import table is the child that has these fields(the product_id(FK), the country_id(FK) , the amount of the imports , the price , the units imported , the way of transport(air, sea) ) we may use the (product_id, thecountry_id`) to identify each row of the imports "if they all in the same year" here the both columns can compose together a primary key in the child table(imports) and also referencing there parent tables.

Please I'm happy I finally understand the concept of the identifying relationship and non identifying relationship, so please don't tell me I'm wrong with all of these vote ups for a completely invalid example

Yes logically a book can't be written without an author but a book can be identified without the author,In fact it can't be identified with the author!

You can 100% remove the author from the book row and still can identify the book!.

Moy answered 1/6, 2016 at 20:7 Comment(1)
You're right, if you only have tables books and authors. There is no identifying relationship there. But if you use a third table to represent the many-to-many relationship, the primary key of that third table consists of two foreign keys, referencing the books table and the authors table. That table has an identifying relationship to both books and authors. See my example in #2814969Mota
B
6

If you consider that the child item should be deleted when the parent is deleted, then it is an identifying relationship.

If the child item should be kept even though the parent is deleted, then it is a non-identifying relatioǹship.

As an example, I have a training database with trainees, trainings, diplomas and training sessions :

  • trainees have an identifying relationship with training sessions
  • trainings have an identifying relationship with training sessions
  • but trainees have a non-identifying relationship with diplomas

Only training sessions should be deleted if one of the related trainee, training or diploma is deleted.

Brassy answered 9/1, 2015 at 15:40 Comment(0)
Y
4

The identifying relationship means the child entity is totally dependant on the existence of the parent entity.

Example An account table person table and personAccount.The person account table is identified by the existence of account and person table only.

The non identifing relationship means the child table is not identified by the existence of the parent table.

Example A table as accountType and account.accountType table is not identified with the existence of an account table.

Yashmak answered 1/4, 2011 at 18:47 Comment(0)
Z
3

Like well explained in the link below, an identifying relation is somewhat like a weak entity type relation to its parent in the ER conceptual model. UML style CADs for data modeling do not use ER symbols or concepts, and the kind of relations are: identifying, non-identifying and non-specific.

Identifying ones are relations parent/child where the child is kind of a weak entity (even at the traditional ER model its called identifying relationship), which does not have a real primary key by its own attributes and therefore cannot be identified uniquely by its own. Every access to the child table, on the physical model, will be dependent (inclusive semantically) on the parent's primary key, which turns into part or total of the child's primary key (also being a foreign key), generally resulting in a composite key on the child side. The eventual existing keys of the child itself are only pseudo or partial-keys, not sufficient to identify any instance of that type of Entity or Entity Set, without the parent's PK.

Non-identifying relationship are the ordinary relations (partial or total), of completely independent entity sets, whose instances do not depend on each others' primary keys to be uniquely identified, although they might need foreign keys for partial or total relationships, but not as the primary key of the child. The child has its own primary key. The parent idem. Both independently. Depending on the cardinality of the relationship, the PK of one goes as a FK to the other (N side), and if partial, can be null, if total, must be not null. But, at a relationship like this, the FK will never be also the PK of the child, as when an identifying relationship is the case.

http://docwiki.embarcadero.com/ERStudioDA/XE7/en/Creating_and_Editing_Relationships

Zachariah answered 10/2, 2016 at 5:14 Comment(0)
A
3

Do attributes migrated from parent to child help identify1 the child?

  • If yes: the identification-dependence exists, the relationship is identifying and the child entity is "weak".
  • If not: the identification-dependence doesn't exists, the relationship is non-identifying and the child entity "strong".

Note that identification-dependence implies existence-dependence, but not the other way around. Every non-NULL FK means a child cannot exist without parent, but that alone doesn't make the relationship identifying.

For more on this (and some examples), take a look at the "Identifying Relationships" section of the ERwin Methods Guide.

P.S. I realize I'm (extremely) late to the party, but I feel other answers are either not entirely accurate (defining it in terms of existence-dependence instead of identification-dependence), or somewhat meandering. Hopefully this answer provides more clarity...


1 The child's FK is a part of child's PRIMARY KEY or (non-NULL) UNIQUE constraint.

Aquino answered 21/1, 2018 at 7:25 Comment(0)
A
2

A good example comes from order processing. An order from a customer typically has an Order Number that identifies the order, some data that occurs once per order such as the order date and the Customer ID, and a series of line items. Each line item contains an item number that identifies a line item within an order, a product ordered, the quantity of that product, the price of the product, and the amount for the line item, which could be computed by multiplying the quantity by the price.

The number that identifies a line item only identifies it in the context of a single order. The first line item in every order is item number "1". The complete identity of a line item is the item number together with the order number of which it is a part.

The parent child relationship between orders and line items is therefore an identifying relationship. A closely related concept in ER modeling goes by the name "subentity", where line item is a subentity of order. Typically, a subentity has a mandatory child-parent identitying relationship to the entity that it's subordinate to.

In classical database design, the primary key of the LineItems table would be (OrderNumber, ItemNumber). Some of today's designers would give an item a separate ItemID, that serves as a primary key, and is autoincremented by the DBMS. I recommend classical design in this case.

Aground answered 19/4, 2009 at 12:7 Comment(0)
T
1

A complement to Daniel Dinnyes' answer:

On a non-identifying relationship, you can't have the same Primary Key column (let's say, "ID") twice with the same value.

However, with an identifyinig relationship, you can have the same value show up twice for the "ID" column, as long as it has a different "otherColumn_ID" Foreign Key value, because the primary key is the combination of both columns.

Note that it doesn't matter if the FK is "non-null" or not! ;-)

Trifolium answered 9/9, 2020 at 21:22 Comment(0)
D
0

An identifying relationship is between two strong entities. A non-identifying relationship may not always be a relationship between a strong entity and a weak entity. There may exist a situation where a child itself has a primary key but existence of its entity may depend on its parent entity.

For example : a relationship between a seller and a book where a book is being sold by a seller may exist where seller may have its own primary key but its entity is created only when a book is being sold

Reference based on Bill Karwin

Dibb answered 29/9, 2013 at 12:0 Comment(1)
It might help to define what you mean by a "strong" and "weak" entity here.Abisia
H
0

Let's say we have those tables:

user
--------
id
name


comments
------------
comment_id
user_id
text

relationship between those two tables will identifiying relationship. Because, comments only can be belong to its owner, not other users. for example. Each user has own comment, and when user is deleted, this user's comments also should be deleted.

Hymettus answered 6/8, 2014 at 12:47 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.