Question

database design you are tasked with creating a logical database design that will be used later...

database design

you are tasked with creating a logical database design that will be used later to implement the physical database named State_Capitals. The first step of creating a logical database design is normalization.

States of the United States
Country Region States
United States Midwest Iowa (IA), Illinois (IL), Indiana (IN), Kansas (KS), Michigan (MI), Minnesota (MN), Missouri (MO), North Dakota (ND), Nebraska (NE), Ohio (OH), South Dakota (SD), Wisconsin (WI)
United States Northeast Connecticut (CT), Delaware (DE), Massachusetts (MA), Maryland (MD), Maine (ME), New Hampshire (NH), New Jersey (NJ), New York (NY), Pennsylvania (PA), Rhode Island (RI), Vermont (VT)
United States South Oklahoma (OK), Texas (TX)
United States Southeast Alabama (AL), Arkansas (AR), Florida (FL), Georgia (GA), Kentucky (KY), Louisiana (LA), Mississippi (MS), North Carolina (NC), South Carolina (SC), Tennessee (TN), Virginia (VA), West Virginia (WV)
United States West Alaska (AK), Arizona (AZ), California (CA), Colorado (CO), Hawaii (HI), Idaho (ID), Montana (MT), New Mexico (NM), Nevada (NV), Oregon (OR), Utah (UT), Washington (WA), Wyoming (WY)

Instructions for the project including the following.

  • Introduction: Explain what normalization is, and the difference between third normal form (3NF) and Boyce-Codd normal form (BCNF).
  • Body: Attached is an Excel spreadsheet that contains data of the 50 states: country, region, and states.
  • Use the normalization technique in this unit to normalize the relation.
  • Conclusion: Explain the resulting relations.

Submit your assignment as a word document

Homework Answers

Answer #1

Normalization:

Normalization is a process of decomposing a relation in sub relations so that:

1. There is no insertion, update, delete anomaly in the table

2. It becomes easy to store the consistent data with reference between the tables

3. It becomes easy to retrieve the data from the tables.

4. There is minimal or no data redundancy (repetition)

There are various normal forms in the data base normalization process as: 1NF, 2NF, 3NF, BCNF, 4NF, 5NF.

********************************************************

Difference between 3rd Normal Form and BCNF:

3rd normal form: It has the condition that a table should be in 2nd normal form and there should be no transitive dependency between non key attributes such that a non key attribute C can be determined from another non key attribute B which further depends on key attribute A. The relation is shown as :

C -> B -> A; thus C -> A i.e. C transitively depends on A

BCNF: It is Boycee Codd Normal form which is an extension of 3rd NF where a table should be in 3Nf aling with the condition that all non key attributes should fully depend on super key where super key is combination of primary key and non key attribute.

Consider the relation R (A,B,C,D) with a as primary key. BCNF holds:

(A, B)-> C, D i.e. C and D can be uniquely identified by composite of A,B

(A, C)-> C,B i.e. C and B can be uniquely identified by composite of A,C

(A, D)-> C,B i.e. C and B can be uniquely identified by composite of A,D

Difference between 3NF and BCNF:

3NF deals with transitive dependency where as BCNF also deals with super key dependency.

All relations in BCNF needs to be 3NF but all 3NF relations may or may not be in BCNF

********************************************************

Given data’s normal form:

The given data is in un-normalized form as:

The States column has multiple values for same record

********************************************************

Normalising given data: Below are the steps to normalize the given data

Normalizing to 1st NF:

A data table is considered to be in 1st NF if:

· It has a primary key

· All tuples hold single value.

But state column has multiple values in it, thus let’s break the relation as below:

CountryRegions(Country, Region)

RegionStates(Region, States)

If we further observe, state is a combination of state name and its code, thus, let’s break States column as below:

RegionStates(Region, StateCode, StateName)

The above relation will still suffer with data redundancy as same Region will repeat in many states, let’s break the table further to add a RegionCode as bellow:

CountryRegions(RegionCode, Region, Country):

RegionStates(StateCode, StateName, RegionCode)

The final 1st NF tables are as below:

1. CountryRegions(RegionCode, Region, Country): RegionCode as primary key

2. RegionStates(StateCode, StateName, RegionCode) : RegionCode is reference key from CountryRegion

Normalizing to 2nd NF:

A relation is considered to be in 2nd NF if:

· It is in 1st NF (already achieved in previous step)

· The non key attributes depends on key attributes.

Consider the Functional dependencies in 1NF tables:

CounrtyRegion: RegionCode -> Region, Country

RegionStates: StateCode -> StateName, RegionCode

As non key attributes has dependency on key attributes, thus it is already in 2NF

Normalising to 3rd NF

A table is considered to be in 3rd NF if:

· It is in 2NF (already achieved in previous step)

· There is no transitive dependency i.e. the non key attribute should not depend on another non key attribute.

The obtained tables has below transitive dependencies:

Region -> Country as Region can determin Country, it is a transitive dependency

State -> RegionCode as State can determine RegionCode, it is a transitive dependency.

Let’s break the relations as below to obtain the 3rd NF:

*Bold underlined are primary keys, italic are foreign keys

Region(RegionCode, Region)

CountryRegions(Country, RegionCode)

State(StateCode, State)

RegionStates(RegionCode, StateCode)

The above 4 tables are in final normalization form.

********************************************************

Logic design for the normalized tables:

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Write a Python  program with correct indentation that creates a dictionary containing the U.S. states as keys...
Write a Python  program with correct indentation that creates a dictionary containing the U.S. states as keys and their abbreviations as values. The program should then randomly quiz the user by displaying the abbreviation and asking the user to enter that state's name. The program should keep a count of the number of correct and incorrect responses, as well as which abbreviation the user missed. You should include the following: Mainline logic and functions Error handling Dictionaries A file is attached...
The ages for each of the fifty states in the US are shown in Table 2...
The ages for each of the fifty states in the US are shown in Table 2 Construct a frequency and a percent frequency distribution of the ages begin the first class at 20.0 and use a class width of 9. Construct a histogram. Comment on the shape of the distribution. which state has the highest age? which state has the lowest age? Table 2 Alabama 76.2 Montana 73.6 Alaska 65.9 Nebraska 78.3 Arizona 79.7 Nevada 75.1 Arkansas 70.9 New Hampshire...
For the year 2009, the table below gives the percent of people living below the poverty...
For the year 2009, the table below gives the percent of people living below the poverty line in the 26 states east of the Mississippi River. Answer the following questions based on this data. State Percent Alabama 7.5 Connecticut 7.9 Delaware 14.9 Florida 13.2 Georgia 12.1 Illinois 10.0 Indiana 9.9 Kentucky 11.9 Maine 13.3 Maryland 10.9 Massachusetts 7.9 Michigan 15.8 Mississippi 9.1 State Percent New Hampshire 14.6 New Jersey 8.3 New York 9.1 North Carolina 12.1 Ohio 13.6 Pennsylvania 10.5...
Data showing the population by state in millions of people follow (The World Almanac, 2012). The...
Data showing the population by state in millions of people follow (The World Almanac, 2012). The dataset in Excel file 2012Population.xlsx. a. Develop a frequency distribution, a percent frequency distribution, and a histogram. Use a class width of 2.5 million. b. Does there appear to be any skewness in the distribution? Explain.   c. What observations can you make about the population of the 50 states? State Population Alabama 4.8 Alaska 0.7 Arizona 6.4 Arkansas 2.9 California 37.3 Colorado 5 Connecticut...
Data on the gasoline tax per gallon (in cents) as of a certain date for the...
Data on the gasoline tax per gallon (in cents) as of a certain date for the 50 U.S. states and the District of Columbia are shown below. State Gasoline Tax per Gallon State Gasoline Tax per Gallon Alabama 20.2 Missouri 17.5 Alaska 8.0 Montana 27.1 Arizona 19.0 Nebraska 27.6 Arkansas 21.1 Nevada 33.4 California 48.9 New Hampshire 19.9 Colorado 22.0 New Jersey 14.8 Connecticut 42.9 New Mexico 18.1 Delaware 23.0 New York 44.2 District of    Columbia 23.8 North Carolina 30.3...
For the year 2009, the table below gives the percent of people living below the poverty...
For the year 2009, the table below gives the percent of people living below the poverty line in the 26 states east of the Mississippi River. Answer the following questions based on this data. State Percent Alabama 7.5 Connecticut 7.9 Delaware 14.9 Florida 13.2 Georgia 12.1 Illinois 10.0 Indiana 9.9 Kentucky 11.9 Maine 13.3 Maryland 10.9 Massachusetts 7.9 Michigan 15.8 Mississippi 9.1 State Percent New Hampshire 14.6 New Jersey 8.3 New York 9.1 North Carolina 12.1 Ohio 13.6 Pennsylvania 10.5...
Is the math SAT score correlated with verbal SAT scores? J. What would you predict the...
Is the math SAT score correlated with verbal SAT scores? J. What would you predict the math SAT score would be for a state which has a verbal SAT score of 510? STATE Mean SAT I Verbal Mean  SAT I Math New York 497 510 Connecticut 515 515 Massachusetts 518 523 New Jersey 501 514 New Hampshire 522 521 D.C. 489 476 Maine 505 501 Pennsylvania 501 502 Delaware 500 499 Georgia 494 493 Rhode Island 503 502 Virginia 515 509...
The homeownership rate in the U.S. was 64.4% in 2009. In order to determine if homeownership...
The homeownership rate in the U.S. was 64.4% in 2009. In order to determine if homeownership is linked with income, 2009 state-level data on the homeownership rate (Ownership in %) and median household income (Income in $) were collected. A portion of the data is shown in the accompanying table. State Ownership Income Alabama 70.4 37990 Alaska 64.6 59614 Arizona 65.9 43749 Arkansas 64.8 34548 California 55.0 54144 Colorado 66.0 53940 Connecticut 68.3 62861 Delaware 73.6 50124 District of Columbia...
Location Percent Obesity Median Income Alabama 33.5 49,936 Alaska 29.7 68,734 Arizona 28.9 62,283 Arkansas 35.9...
Location Percent Obesity Median Income Alabama 33.5 49,936 Alaska 29.7 68,734 Arizona 28.9 62,283 Arkansas 35.9 49,781 California 24.7 70,489 Colorado 21.3 73,034 Connecticut 26.3 72,812 Delaware 30.7 65,012 District of Columbia 21.7 85,750 Florida 26.2 54,644 Georgia 30.5 55,821 Hawaii 22.1 80,108 Idaho 28.9 58,728 Illinois 29.3 70,145 Indiana 32.7 59,892 Iowa 30.9 68,718 Kansas 31.3 63,938 Kentucky 31.6 54,555 Louisiana 34.9 49,973 Maine 28.2 58,663 Maryland 29.6 86,223 Massachusetts 23.3 86,345 Michigan 30.7 60,449 Minnesota 27.6 71,817 Mississippi...
1- The founder and president of this bank were delegates to the Continental Congress that issued...
1- The founder and president of this bank were delegates to the Continental Congress that issued the Declaration of Independence: a. The Bank of North America. b. The Second Bank of the United States. c. The Third Bank of the United States. d. Wells Fargo Bank. e. The Bank of England. 2- According to Trask, what region of the country continued to conduct trade with Great Britain during the War of 1812? a. The South. b. The Middle-Atlantic states. c....