Question

database design you are tasked with creating a logical database design that will be used later...

database design

you are tasked with creating a logical database design that will be used later to implement the physical database named State_Capitals. The first step of creating a logical database design is normalization.

States of the United States
Country Region States
United States Midwest Iowa (IA), Illinois (IL), Indiana (IN), Kansas (KS), Michigan (MI), Minnesota (MN), Missouri (MO), North Dakota (ND), Nebraska (NE), Ohio (OH), South Dakota (SD), Wisconsin (WI)
United States Northeast Connecticut (CT), Delaware (DE), Massachusetts (MA), Maryland (MD), Maine (ME), New Hampshire (NH), New Jersey (NJ), New York (NY), Pennsylvania (PA), Rhode Island (RI), Vermont (VT)
United States South Oklahoma (OK), Texas (TX)
United States Southeast Alabama (AL), Arkansas (AR), Florida (FL), Georgia (GA), Kentucky (KY), Louisiana (LA), Mississippi (MS), North Carolina (NC), South Carolina (SC), Tennessee (TN), Virginia (VA), West Virginia (WV)
United States West Alaska (AK), Arizona (AZ), California (CA), Colorado (CO), Hawaii (HI), Idaho (ID), Montana (MT), New Mexico (NM), Nevada (NV), Oregon (OR), Utah (UT), Washington (WA), Wyoming (WY)

Instructions for the project including the following.

  • Introduction: Explain what normalization is, and the difference between third normal form (3NF) and Boyce-Codd normal form (BCNF).
  • Body: Attached is an Excel spreadsheet that contains data of the 50 states: country, region, and states.
  • Use the normalization technique in this unit to normalize the relation.
  • Conclusion: Explain the resulting relations.

Submit your assignment as a word document

Homework Answers

Answer #1

Normalization:

Normalization is a process of decomposing a relation in sub relations so that:

1. There is no insertion, update, delete anomaly in the table

2. It becomes easy to store the consistent data with reference between the tables

3. It becomes easy to retrieve the data from the tables.

4. There is minimal or no data redundancy (repetition)

There are various normal forms in the data base normalization process as: 1NF, 2NF, 3NF, BCNF, 4NF, 5NF.

********************************************************

Difference between 3rd Normal Form and BCNF:

3rd normal form: It has the condition that a table should be in 2nd normal form and there should be no transitive dependency between non key attributes such that a non key attribute C can be determined from another non key attribute B which further depends on key attribute A. The relation is shown as :

C -> B -> A; thus C -> A i.e. C transitively depends on A

BCNF: It is Boycee Codd Normal form which is an extension of 3rd NF where a table should be in 3Nf aling with the condition that all non key attributes should fully depend on super key where super key is combination of primary key and non key attribute.

Consider the relation R (A,B,C,D) with a as primary key. BCNF holds:

(A, B)-> C, D i.e. C and D can be uniquely identified by composite of A,B

(A, C)-> C,B i.e. C and B can be uniquely identified by composite of A,C

(A, D)-> C,B i.e. C and B can be uniquely identified by composite of A,D

Difference between 3NF and BCNF:

3NF deals with transitive dependency where as BCNF also deals with super key dependency.

All relations in BCNF needs to be 3NF but all 3NF relations may or may not be in BCNF

********************************************************

Given data’s normal form:

The given data is in un-normalized form as:

The States column has multiple values for same record

********************************************************

Normalising given data: Below are the steps to normalize the given data

Normalizing to 1st NF:

A data table is considered to be in 1st NF if:

· It has a primary key

· All tuples hold single value.

But state column has multiple values in it, thus let’s break the relation as below:

CountryRegions(Country, Region)

RegionStates(Region, States)

If we further observe, state is a combination of state name and its code, thus, let’s break States column as below:

RegionStates(Region, StateCode, StateName)

The above relation will still suffer with data redundancy as same Region will repeat in many states, let’s break the table further to add a RegionCode as bellow:

CountryRegions(RegionCode, Region, Country):

RegionStates(StateCode, StateName, RegionCode)

The final 1st NF tables are as below:

1. CountryRegions(RegionCode, Region, Country): RegionCode as primary key

2. RegionStates(StateCode, StateName, RegionCode) : RegionCode is reference key from CountryRegion

Normalizing to 2nd NF:

A relation is considered to be in 2nd NF if:

· It is in 1st NF (already achieved in previous step)

· The non key attributes depends on key attributes.

Consider the Functional dependencies in 1NF tables:

CounrtyRegion: RegionCode -> Region, Country

RegionStates: StateCode -> StateName, RegionCode

As non key attributes has dependency on key attributes, thus it is already in 2NF

Normalising to 3rd NF

A table is considered to be in 3rd NF if:

· It is in 2NF (already achieved in previous step)

· There is no transitive dependency i.e. the non key attribute should not depend on another non key attribute.

The obtained tables has below transitive dependencies:

Region -> Country as Region can determin Country, it is a transitive dependency

State -> RegionCode as State can determine RegionCode, it is a transitive dependency.

Let’s break the relations as below to obtain the 3rd NF:

*Bold underlined are primary keys, italic are foreign keys

Region(RegionCode, Region)

CountryRegions(Country, RegionCode)

State(StateCode, State)

RegionStates(RegionCode, StateCode)

The above 4 tables are in final normalization form.

********************************************************

Logic design for the normalized tables:

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Write a Python  program with correct indentation that creates a dictionary containing the U.S. states as keys...
Write a Python  program with correct indentation that creates a dictionary containing the U.S. states as keys and their abbreviations as values. The program should then randomly quiz the user by displaying the abbreviation and asking the user to enter that state's name. The program should keep a count of the number of correct and incorrect responses, as well as which abbreviation the user missed. You should include the following: Mainline logic and functions Error handling Dictionaries A file is attached...
The ages for each of the fifty states in the US are shown in Table 2...
The ages for each of the fifty states in the US are shown in Table 2 Construct a frequency and a percent frequency distribution of the ages begin the first class at 20.0 and use a class width of 9. Construct a histogram. Comment on the shape of the distribution. which state has the highest age? which state has the lowest age? Table 2 Alabama 76.2 Montana 73.6 Alaska 65.9 Nebraska 78.3 Arizona 79.7 Nevada 75.1 Arkansas 70.9 New Hampshire...
For the year 2009, the table below gives the percent of people living below the poverty...
For the year 2009, the table below gives the percent of people living below the poverty line in the 26 states east of the Mississippi River. Answer the following questions based on this data. State Percent Alabama 7.5 Connecticut 7.9 Delaware 14.9 Florida 13.2 Georgia 12.1 Illinois 10.0 Indiana 9.9 Kentucky 11.9 Maine 13.3 Maryland 10.9 Massachusetts 7.9 Michigan 15.8 Mississippi 9.1 State Percent New Hampshire 14.6 New Jersey 8.3 New York 9.1 North Carolina 12.1 Ohio 13.6 Pennsylvania 10.5...
Data showing the population by state in millions of people follow (The World Almanac, 2012). The...
Data showing the population by state in millions of people follow (The World Almanac, 2012). The dataset in Excel file 2012Population.xlsx. a. Develop a frequency distribution, a percent frequency distribution, and a histogram. Use a class width of 2.5 million. b. Does there appear to be any skewness in the distribution? Explain.   c. What observations can you make about the population of the 50 states? State Population Alabama 4.8 Alaska 0.7 Arizona 6.4 Arkansas 2.9 California 37.3 Colorado 5 Connecticut...
Data on the gasoline tax per gallon (in cents) as of a certain date for the...
Data on the gasoline tax per gallon (in cents) as of a certain date for the 50 U.S. states and the District of Columbia are shown below. State Gasoline Tax per Gallon State Gasoline Tax per Gallon Alabama 20.2 Missouri 17.5 Alaska 8.0 Montana 27.1 Arizona 19.0 Nebraska 27.6 Arkansas 21.1 Nevada 33.4 California 48.9 New Hampshire 19.9 Colorado 22.0 New Jersey 14.8 Connecticut 42.9 New Mexico 18.1 Delaware 23.0 New York 44.2 District of    Columbia 23.8 North Carolina 30.3...
The following table contains a list of the 50 states assigned with the numbers 0 to...
The following table contains a list of the 50 states assigned with the numbers 0 to 49. Número Estado Número Estado 00 Alabama 25 Montana 01 Alaska 26 Nebraska 02 Arizona 27 Nevada 03 Arkansas 28 Nuevo Hampshire 04 California 29 New Jersey 05 Colorado 30 New México 06 Connecticut 31 New York 07 Delaware 32 North Carolina 08 Florida 33 North Dakota 09 Georgia 34 Ohio 10 Hawái 35 Oklahoma 11 Idaho 36 Oregón 12 Illinois 37 Pennsylvania 13...
For the year 2009, the table below gives the percent of people living below the poverty...
For the year 2009, the table below gives the percent of people living below the poverty line in the 26 states east of the Mississippi River. Answer the following questions based on this data. State Percent Alabama 7.5 Connecticut 7.9 Delaware 14.9 Florida 13.2 Georgia 12.1 Illinois 10.0 Indiana 9.9 Kentucky 11.9 Maine 13.3 Maryland 10.9 Massachusetts 7.9 Michigan 15.8 Mississippi 9.1 State Percent New Hampshire 14.6 New Jersey 8.3 New York 9.1 North Carolina 12.1 Ohio 13.6 Pennsylvania 10.5...
Is the math SAT score correlated with verbal SAT scores? J. What would you predict the...
Is the math SAT score correlated with verbal SAT scores? J. What would you predict the math SAT score would be for a state which has a verbal SAT score of 510? STATE Mean SAT I Verbal Mean  SAT I Math New York 497 510 Connecticut 515 515 Massachusetts 518 523 New Jersey 501 514 New Hampshire 522 521 D.C. 489 476 Maine 505 501 Pennsylvania 501 502 Delaware 500 499 Georgia 494 493 Rhode Island 503 502 Virginia 515 509...
1. Use "Life_Expectancy" data in Chapter2.xlsx to answer the following questions. For questions that require Excel,...
1. Use "Life_Expectancy" data in Chapter2.xlsx to answer the following questions. For questions that require Excel, include the appropriate output (copy + paste) along with an explanation. Data description: Life Expectancy by State, 2010–2011 Questions: a. Construct appropriate tables (frequency distribution, relative frequency distribution, etc.) and graphs to summarize life expectancy in the United States. Use 75, 76.5, 78, and so on, for the upper limits of the classes for the distributions. b. Discuss where the data tend to cluster...
The homeownership rate in the U.S. was 64.4% in 2009. In order to determine if homeownership...
The homeownership rate in the U.S. was 64.4% in 2009. In order to determine if homeownership is linked with income, 2009 state-level data on the homeownership rate (Ownership in %) and median household income (Income in $) were collected. A portion of the data is shown in the accompanying table. State Ownership Income Alabama 70.4 37990 Alaska 64.6 59614 Arizona 65.9 43749 Arkansas 64.8 34548 California 55.0 54144 Colorado 66.0 53940 Connecticut 68.3 62861 Delaware 73.6 50124 District of Columbia...