f Flashcards

Question 1

Q

Aggregation:

Answer

A

Used to model a relationship involving a relationship set

Allows us to treat a relationship set as an entity set for purposes of participation in other relationships

Question 2

Q

Lowest-level ER model:

Answer

A

physical data model (PDM) – most detailed

Question 3

Q

is ER design subjective

Question 4

Q

Entity:

Answer

A

real-world object, distinguishable from other objects. An entity is described as using a set of attributes.

Question 5

Q

Reasons to use ISA:

Answer

A

Add descriptive attributes specific to a subclass (ex. Not appropriate for all entities in the superclass)

Identify entities that participate in a particular relationship (ex. Not all superclass entities participate)

Question 6

Q

Weak Entity:

Answer

A

Weak entity can be identified uniquely only by considering the primary key of another (owner) entity.

Weak entity set must have total participation in this identifying relationship set.

Weak entities only have a “partial key” (dashed underline)

Question 7

Q

The steps:

Answer

A

Identify the entities
Identify the relationships between the entities
Determine whether a relationship is 1:1 1:M or M:N
Determine whether participation in a relationship mandatory (At least one instance or full participation) or not
Identify weak entities
Identify ISA hierarchies and aggregations
Consider possible refinements: should a concept be modeled as an entity or attribute? Should a concept be modeled as an entity or a relationship? Identifying relationships? Binary or ternary?

Question 8

Q

Mid-level ER model:

Answer

A

Logical data model (LDM)

Question 9

Q

In translation a relationship set to a relation, attributes of the relation must include(3):

Answer

A

keys for each participating entity set (as foreign keys)
this set of attributes forms a superkey for the relation
all descriptive attributes

Question 10

Q

Owner entity set and weak entity set must participate in

Answer

A

Owner entity set and weak entity set must participate in a one-to-many relationship set (one owner, many weak entities).

Question 11

Q

Entity set:

Answer

A

a connection of similar entities (ex. All employees). All entities in an entity set have the same set of attributes, each entity set has a key (underlined), each attribute has a domain.

Question 12

Q

Locks and transaction design:

Answer

A

When you change a row, no one else can modify it until you issue a COMMIT, try not to hold locks for too long since it will slow down other sessions

Question 13

Q

Two biggest causes of contention are

Answer

A

locks and latches (or mutexes)

Question 14

Q

Types of contention (3):

Answer

A

locks
latches and mutex
buffer contention

Question 15

Q

Two ways to design transaction locking?

Answer

A

Pessimistic and Optimistic

Question 16

Q

Contention prevents:

Answer

A

Contention prevents the database from working on all of the requests that are outstanding

Question 17

Q

High-level ER model:

Answer

A

Conceptual data model (CDM)

Question 18

Q

Contention is another word for

Answer

A

bottleneck

Question 19

Q

Pessimistic locking:

Answer

A

works best if you think someone else is going to “grab” your row before you are finished

Question 20

Q

Hints:

Answer

A

You can add a hint to your SQL to change the execution plan

SELECT /* index(index name) */

Question 21

Q

Performance problems are often seen as

Answer

A

unacceptable response time or throughput

Question 22

Q

Are indexes good for small or large amount of the tables?

Answer

A

Indexes are only good for getting small amount of the table

Question 23

Q

Tuning involves

Answer

A

proactive monitoring and bottleneck elimination, providing room for system scalability (process more workload)

Question 24

Q

Partitioning:

Answer

A

Split table up to make them smaller

Question 25

Q

Execution plan:

Answer

A

Called explain plan or query plan. Tells you how the optimizer will execute your SQL

Question 26

Q

Transaction design:

Answer

A

Minimize lock duration using pessimistic and optimistic locking strategies

Question 27

Q

Histograms:

Answer

A

Histogram allows the optimizer to understand how the data is being distributed and make the best decision

Question 28

Q

Network overhead:

Answer

A

Avoid unnecessary network round trips.

Question 29

Q

Optimistic locking:

Answer

A

works best if you think no one else will “grab” the row

Question 30

Q

Denormalize:

Answer

A

merge tables to avoid joins, create “materialized views: to avoid big grouping or filtering

Question 31

Q

SQL tuning

Answer

A

optimizer determines execution plan

Question 32

Q

does a concatenated index have 1 or many columns?

Answer

A

Concatenated index has more than one column

Question 33

Q

Two ways to reduce round trips:

Answer

A

Use the “Array” interface in your program code

Use the stored procedures for complex interactions with the database

Question 34

Q

Should you create an index on all columns?

Answer

A

Do not create an index on every column

Question 35

Q

what do you violate in Denormalization?

Answer

A

In denormalization you violate 3NF to improve performance

Question 36

Q

Configure Memory:

Answer

A

use memory to avoid I/O, operations to read from memory, operations to share memory

Question 37

Q

IT Service Management (ITSM) can be used to

Answer

A

IT Service Management (ITSM) can be used to identify and maintain service levels

Question 38

Q

When to not use NULLS?

Answer

A

Do not use NULL if you will search for NULLS use something that can be indexed (N/A)

Question 39

Q

Reduce contention:

Answer

A

avoid contention for locks and latches

Question 40

Q

Instance tuning

Answer

A

avoid bottlenecks in initial design and later monitor performance

Question 41

Q

Indexes:

Answer

A

Create indexes on groups of columns that are queried together
Indexes speed up queries but make DML slower

Question 42

Q

Nulls:

Answer

A

Using nulls has significant performance implications (nulls do not take any space, nulls cannot be indexed)

Question 43

Q

Performance management tools do what?

Answer

A

Monitor, estimate, plan capacity, analyze, reorganize, optimize, cache, compress, sort

Question 44

Q

What can you do for optimal performance(4)?

Answer

A

Tune the application
Reduce contention
Configure Memory
Tune I/O

Question 45

Q

Tune the application:

Answer

A

choose the best data model, reduce the load on the database, tune the SQL statements

Question 46

Q

3 Things we can do to tune our application:

Answer

A

Structure the tables in a way that the database would work better
Tune the application code (Java, C++ etc.)
Tune the SQL statements

Question 47

Q

Should you put a high or low load on a database and why?

Answer

A

Do not put much load on the database

When the application puts too much load on the database, and the performance decrease, it is not the database fault

Question 48

Q

When do subtypes occur?

Answer

A

Subtypes can occur when you are modelling things that are almost the same

Question 49

Q

Tune I/O:

Answer

A

use fast disks, use RAID (o + 1), use SSDs

Question 50

Q

Performance management tools(7):

Answer

A

Oracle performance method – eliminating bottlenecks and developing efficient SQL statements
Database self-monitoring – sends alerts to notify of impending problem using expected values for comparison
AWR (Automatic Workload Repository) – performance history
AWR baseline-statistics with DB performing well at peak load
Adaptive threshold-warning and critical alert thresholds
ADDM uses AWR statistics to diagnose performance
OEM (Oracle Enterprise Manager) – GUI for maintenance

Question 51

Q

Why start in 3NF?

Answer

A

Starting point is 3NF, it takes all redundancy in the data and determines the PKs. Every column in a table should be identified by: the PK, all of the PK, nothing but the PK.

Question 52

Q

usually locking problems are due to :

Answer

A

usually locking problems are due to application locks, sometimes internal locks can cause problems

Question 53

Q

Database buffers cache table and index data to avoid what?

Answer

A

Database buffers cache table and index data to avoid reading data from the disk

Question 54

Q

What do locks prevent?

Answer

A

Prevent 2 sessions from changing table data at the same time – this avoids “lost updates”

Question 55

Q

Database might have to perform a sort if:

Answer

A

SQL contains an ORDER BY or GROUP BY
The two tables are being joined without an index, both tables are sorted and the results merged (called a SORT-MERGE join)

Question 56

Q

what are soft areas and what do they allow?

Answer

A

Soft areas (Called PGA in Oracle) allow sorting and hash structures to be maintained in memory; otherwise they would be written to a temporary

Question 57

Q

Latches and Mutex:

Answer

A

Latches are very light weight locks that protect memory instead of tables

Question 58

Q

Buffer cache ‘hit rate’:

Answer

A

When asked for data, the database first looks for it in the memory buffers.

If the data is found in memory it is called a “hit”, otherwise the data must be read from the disk (Called a “miss”)

Question 59

Q

Buffer contention:

Answer

A

Memory itself can become a problem; no one can get the memory they want.

Question 60

Q

If there is not enough memory to do the hash table or a sort in memory, then the database will…?

Answer

A

If there is not enough memory to do the hash table or a sort in memory, then the database will read and write data to a temporary file group

Question 61

Q

What do database buffers improve?

Answer

A

Database buffers improve performance by caching data in memory

Question 62

Q

Hash join

Answer

A

Hash joins are more efficient alternative to SORT-MERGE. A hash table is built on one of the tables and acts like an “on the fly” index

Question 63

Q

What happens when a buffer is modified?

Answer

A

When buffers are modified they are called dirty; these have to be written to disk

Question 64

Q

Hit rate ratio?

Answer

A

The ratio (hit/(hits + misses)) is called the ‘hit rate’

Answer 64

A

Latches are like locks, but instead of protecting table rows, they protect memory (buffers)

Answer 65

A

Disk IO is the slowest part of the database system, so it is critical to performance

Answer 66

A

Most locking problems are caused by application code, optimistic locking strategy is often the solution

There are system locks that can cause problems, these are rare and database specific

Answer 67

A

Latency is the time taken to perform a single IO

Answer 68

A

When all the blocks are dirty then sessions have to wait for the buffers to be written before new data can be read

Answer 69

A

Throughput is the number of operations over time (IO/second)

Answer 70

A

for a latch:
If two sessions try to access the same area of memory, then one will wait

Instead of “sleeping” (like a lock) the waiting session will “spin” on the CPU for a very short time

Answer 71

A

High latency means you are overloading the disk

Answer 72

A

throughput, disk fill

Answer 73

A

To avoid overloading disks, we combine multiple disks into an “array”. The array can then support higher amounts of disk IO

Answer 74

A

sparsely populate, under only moderate load

Answer 75

A

The array can also protect the disk data loss by storing multiple copies of data

Answer 76

A

When the disk is overloaded, latency goes up and throughput stalls (called the “hockey stick” curve)

Answer 77

A

“RAID” levels describe the type of array. RAID levels 1,0 and 5 are the most frequently used

Answer 78

A

Distributes data across disks like RAID 0

Creates a “parity” block for every data block that can be used to recover data if the disk fails

Answer 79

A

But they are a poor choice for data that does not get accessed very often

Answer 80

A

RAID 5 requires less disks that RAID 0+1 so it’s cheaper (but also much slower when writing)

Answer 81

A

very high IP rates

Answer 82

A

Stripping and mirroring together

Best performance

Protection against data loss

More expensive (more disks) than RAID 5

Best solution for database files

Answer 83

A

Also called striping

Answer 84

A

use of information as directive rather than indicator of potential problem

Answer 85

A

Data is spread across multiple disks to distribute IO evenly

Good performance but no protection against data loss

Answer 86

A

Also called mirroring

Answer 87

A

Data is duplicated on two or more disks

Protects against data loss, but does not spread the IO across disks

Answer 88

A

Data governance program oversees the management of the quality, maintainability, availability, usability, integrity, scalability and security of enterprise data

Answer 89

A

DBMS software – migrations, procedures
Hardware configuration
Logical and physical design
Applications
Physical database structures

Answer 90

A

Impact
Prosecution
Cost
Durability

Answer 91

A

upper-level management is keenly aware of the need to comply

Answer 92

A

DBA does not request change (programmers, application owners, business owners do)

Answer 93

A

DBA carries out most database changes

Answer 94

A

Proactivity
Intelligence
Analysis
Automation
Standardization of procedure
Reliable and predictable process
Availability
Quick and efficient delivery

Answer 95

A

can result in huge fines and imprisonment

Answer 96

A

can be significant but so can the cost of non-compliance

Answer 97

A

increasing regulation – increasing time, effort and capital will be spent on compliance

Answer 98

A

Business
legal
IT

Answer 99

A

removes the sensitive data by deleting it

Answer 100

A

must understand the legal requirements imposed on their data and systems as dictated in regulations

Answer 101

A

scrambles the data algorithmically. Thi5s technique will not produce realistic looking data and can make the data larger

Answer 102

A

Metadata management and data quality
Database and data access auditing
Data masking and obfuscation
Long-term data retention and database archiving
Closer tracking of traditional DBA tasks

Answer 103

A

varies the existing values in a specified range in order to obfuscate them

Answer 104

A

Adding columns to tables – not a good idea
DBMS traces – ISV offering is better
Log based - missing read activity
Network sniffing – missing server requests
Capture requests at the server

Answer 105

A

uses the existing data and moves the values between rows in such a way that the no values are present in their original rows

Answer 106

A

must be involved to interpret the legal language of the regulations and ensure that the business is taking proper steps to protect itself

Answer 107

A

replaces existing data with random values from a pre-prepared data set

Answer 108

A

Data masking is the process of protecting sensitive and personally identifiable information (PII) in non-production databases from inappropriate visibility

Answer 109

A

must be involved to implement the policies and procedures to enact the technology to support the regulatory mandates

Answer 110

A

masks data assuring that the results are referentially intact.

Answer 111

A

Substitution
Shuffling
Number and data variance
Encryption
Nulling out
Table-to-table synchronization

Answer 112

A

online analytical processing

Answer 113

A

a collection of integrated, non-volatile, time-variant, subject oriented databases designed to support the DSS function.

Answer 114

A

differs from transactional operational data in timespan, granularity and dimension

Answer 115

A

data distrubution system

Answer 116

A

very large databases

Answer 117

A

Data warehouses are designed for analytical processing.

Answer 118

A

online transaction processing

Answer 119

A

Create
Operational (completing business transactions)
Reference (reporting or queries)
Archive (compliance and business protection)
discard

Answer 120

A

The data warehouse contains atomic data and lightly summarized data.

Answer 121

A

Comprehensive, cohesive, integrated tools and processes.

Answer 122

A

Drilling up/down hierarchies
Comparing aggregate values
Parallel execution

Brainscape's Knowledge GenomeTM

f Flashcards

Brainscape's Knowledge Genome^TM