Saturday, 1 May 2021

Advanced Databases Pune University Question Paper Answers 2013

Pune University BE(CSE) Question Papers / Previous year BE(Computer Engg.) question papers of Pune University / BE CSE Advanced Databases Question Paper / Pune University Questions with Answers

[Click on the links to view answers]



[Total No. of Questions: 12]                                     [Total No. of Printed Pages : ]
UNIVERSITY OF PUNE
[4364]-777
B. E. (Computer Engg.)(SEM-II) Examination - 2013
Advanced Databases (2008 Course)
[Time: 3 Hours]                                                                       [Max. Marks: 100]
Instructions:
1 Answer any three questions from each section.
2 Answers to the two sections should be written in separate answer-books.
3 Neat diagrams must be drawn wherever necessary.
4 Black figures to the right indicate full marks.
5 Use of logarithmic tables, slide rule, Mollier charts, electronic pocket calculator and steam tables is allowed.
6 Assume suitable data, if necessary.

SECTION -I

B) Write a short note on Parallel query optimization. [5]
Answer:
It is a form of parallelism where many different Queries or Transactions are executed in parallel with one another on many processors.
 
OR
B) Explain interoperation parallelism suitable example. [6]

B) Explain semi-join strategy along with example. [6]
C) Write short note on LDAP. [4]
OR
B) Explain the difference between data replication in a distributed system and the maintenance of a remote backup site. [2]
C) What are the different approaches to store a relation in the distributed database. Explain them in brief. [6]
D) Write short note on multidatabase system. [3]

Q. 5 A) Write short notes on: [8]
i) SOAP.
ii) XML DTD.
B) Explain the structure of XML data with example. [8]
OR
Q. 6 A) Explain the following with respect to web architecture; [8]
i) Web server.
ii) Common gateway interface.
iii) Cookie.
iv) Uniform Resource Locator.
B) Which are different parsers for XML? Explain them in detail. [8]

SECTION II

Answer:
Data cleaning
(or data cleansing) routines attempt to fill in missing values, smooth out noise while identifying outliers, and correct inconsistencies in the data. 
1. Missing values
Ignore the tuple
Fill in the missing value manually
Use a global constant to fill in the missing value
Use a measure of central tendency for the attribute (such as the mean or median) to fill in the missing value 
Use the attribute mean or median for all samples belonging to the same class as the given tuple
Use the most probable value to fill in the missing value

2. Noisy data
Binning
Regression
Outlier analysis

B) Explain architecture of data warehouse with a neat diagram. [6]
C) A data warehouse can be modeled by either a star schema or a snowflake schema. Briefly describe the similarities and the differences of the two models, and then analyze their advantages and disadvantages with regard to one another. Give your opinion of which might be more empirically useful and state the reasons behind your answer. [4]
OR
Q. 8 A) Explain indexing OLAP data with example [6]
B) Explain the following operation on the multidimensional data with example. [6]
i) Roll up and drill down. ii) Slicing & dicing
Answer:
  • Enterprise warehouse

    • collects all of the information about subjects spanning the entire organization
  • Data Mart

    • a subset of corporate-wide data that is of value to a specific groups of users. Its scope is confined to specific, selected groups, such as marketing data mart
      • Independent vs. dependent (directly from warehouse) data mart
  • Virtual warehouse

    • A set of views over operational databases
    • Only some of the possible summary views may be materialized
 

Q. 9 A) Consider following training set. [8]
Class Label
A
B
C
C1
S
Y
X
C1
B
Y
X
C1
B
R
X
C1
S
R
X
C2
S
B
X
C2
B
B
Z
C2
B
Y
Z
C2
B
B
X
C2
S
Y
Z
Construct decision tree based on above training set using ID3.
B) Explain K mean algorithm with example. Also state it weakness [8]
OR
Q. 10 A) A database has 5 transactions. Let 𝑚𝑖𝑛 𝑠𝑢𝑝 = 0.6 and 𝑚𝑖𝑛 𝑐𝑜𝑛𝑓 = 0.8. [8]
Customer
Date
Items bought
100
10/15
{I,P,A,D,B,C}
200
10/15
{D,A,E,F}
300
10/16
{C,D,B,E}
400
10/18
{B,A,C,K,D}
500
10/19
{A,G,T,C}
i) List the frequent 𝑘-itemset for the largest 𝑘,
ii) List all the strong association rules (with support and confidence)
B) Explain in detail classification and prediction. What is the difference between them. [8]

Q. 11 A) What do you mean by relevance ranking? Explain TF/IDF methods of ranking. [8]
B) Explain the following: [8]
i) Inverted Index
ii) Ontology
iii) Stop Words.
iv) Random walk
OR
Q. 12 A) What is page ranking and popularity ranking? Explain in brief. [8]
B) Explain the following terms [8]
i) Web crawlers.
ii) Homonyms
iii) Vector space model
iv) Synonyms

*************************




No comments:

Post a Comment

Database Management Systems Anna University Exam Questions and Answers

Database management systems university question papers with answers, Anna university DBMS exam questions, Solved university exam questions f...