Pune University BE(CSE) Question Papers / Previous year BE(Computer Engg.) question papers of Pune University / BE CSE Advanced Databases Question Paper / Pune University Questions with Answers
[Click on the links to view answers]
[Total No. of
Questions: 12] [Total No.
of Printed Pages : ]
UNIVERSITY
OF PUNE
[4364]-777
B.
E. (Computer Engg.)(SEM-II) Examination - 2013
Advanced
Databases (2008 Course)
[Time:
3 Hours] [Max.
Marks: 100]
Instructions:
1
Answer any three questions from each section.
2
Answers to the two sections should
be written in separate answer-books.
3
Neat diagrams must be drawn wherever necessary.
4
Black figures to the right indicate full marks.
5
Use of logarithmic tables, slide rule, Mollier charts, electronic pocket
calculator and steam tables is allowed.
6
Assume suitable data, if necessary.
SECTION
-I
B)
Write a short note on Parallel query optimization. [5]
Answer:
It is a form
of parallelism where many different Queries or Transactions are executed in
parallel with one another on many processors.
OR
B)
Explain interoperation parallelism suitable example. [6]
Q.
3 A)
If we are to ensure atomicity, all the sites in which a transaction T executed must agree on the final outcome of the execution T must either commit at all sites, or it must abort at all sites. Describe the techniques or protocol used to ensure this property in detail. [7] ALTERNATE LINK
B)
Explain semi-join strategy along with example. [6]
C)
Write short note on LDAP. [4]
OR
B)
Explain the difference between data replication in a distributed system and the
maintenance of a remote backup site. [2]
C)
What are the different approaches to store a relation in the distributed database.
Explain them in brief. [6]
D)
Write short note on multidatabase system. [3]
Q.
5 A)
Write short notes on: [8]
i) SOAP.
ii) XML DTD.
B)
Explain the structure of XML data with example. [8]
OR
Q.
6 A)
Explain the following with respect to web architecture; [8]
i) Web server.
ii) Common
gateway interface.
iii) Cookie.
iv) Uniform
Resource Locator.
B)
Which are different parsers for XML? Explain them in detail. [8]
SECTION
II
Q.
7 A)
What are different data cleaning methods? [8]
Answer:
Data cleaning
(or data cleansing) routines attempt to fill in missing values, smooth out noise while identifying outliers, and correct inconsistencies in the data.
(or data cleansing) routines attempt to fill in missing values, smooth out noise while identifying outliers, and correct inconsistencies in the data.
1. Missing values
Ignore the tuple
Fill in the missing value manually
Use a global constant to fill in the missing value
Use a measure of central tendency for the attribute (such as the mean or median) to fill in the missing value
Use the attribute mean or median for all samples belonging to the same class as the given tuple
Use the most probable value to fill in the missing value
2. Noisy data
Binning
Regression
Outlier analysis
B)
Explain architecture of data warehouse with a neat diagram. [6]
C)
A data warehouse can be modeled by either a star schema or a snowflake schema.
Briefly describe the similarities and the differences of the two models, and
then analyze their advantages and disadvantages with regard to one another.
Give your opinion of which might be more empirically useful and state the
reasons behind your answer. [4]
OR
Q.
8 A)
Explain indexing OLAP data with example [6]
B)
Explain the following operation on the multidimensional data with example. [6]
i) Roll up and drill down. ii)
Slicing & dicing
Answer:
Enterprise warehouse
- collects all of the information about subjects spanning the entire organization
Data Mart
- a subset of corporate-wide data that is of value to a specific
groups of users. Its scope is confined to specific, selected groups,
such as marketing data mart
- Independent vs. dependent (directly from warehouse) data mart
- a subset of corporate-wide data that is of value to a specific
groups of users. Its scope is confined to specific, selected groups,
such as marketing data mart
Virtual warehouse
- A set of views over operational databases
- Only some of the possible summary views may be materialized
Q.
9 A)
Consider following training set. [8]
Class Label
|
A
|
B
|
C
|
C1
|
S
|
Y
|
X
|
C1
|
B
|
Y
|
X
|
C1
|
B
|
R
|
X
|
C1
|
S
|
R
|
X
|
C2
|
S
|
B
|
X
|
C2
|
B
|
B
|
Z
|
C2
|
B
|
Y
|
Z
|
C2
|
B
|
B
|
X
|
C2
|
S
|
Y
|
Z
|
Construct decision tree based on above
training set using ID3.
B)
Explain K mean algorithm with example. Also state it weakness [8]
OR
Q.
10 A) A database has 5 transactions. Let 𝑚𝑖𝑛
𝑠𝑢𝑝
= 0.6 and 𝑚𝑖𝑛
𝑐𝑜𝑛𝑓
= 0.8. [8]
Customer
|
Date
|
Items bought
|
100
|
10/15
|
{I,P,A,D,B,C}
|
200
|
10/15
|
{D,A,E,F}
|
300
|
10/16
|
{C,D,B,E}
|
400
|
10/18
|
{B,A,C,K,D}
|
500
|
10/19
|
{A,G,T,C}
|
i) List the frequent 𝑘-itemset for
the largest 𝑘,
ii) List all the strong
association rules (with support and confidence)
B)
Explain in detail classification and prediction. What is the difference between
them. [8]
Q.
11 A) What do you mean by relevance ranking? Explain
TF/IDF methods of ranking. [8]
B)
Explain the following: [8]
i) Inverted Index
ii) Ontology
iii) Stop
Words.
iv) Random
walk
OR
Q.
12 A) What is page ranking and popularity ranking?
Explain in brief. [8]
B)
Explain the following terms [8]
i) Web
crawlers.
ii) Homonyms
iii) Vector
space model
iv) Synonyms
*************************
No comments:
Post a Comment