Εαρινό Εξάμηνο 2006. web caching. Γιάννος Γρηγορίου

Σχετικά έγγραφα
Instruction Execution Times

Assalamu `alaikum wr. wb.

Physical DB Design. B-Trees Index files can become quite large for large main files Indices on index files are possible.

ΙΑΤΜΗΜΑΤΙΚΟ ΠΡΟΓΡΑΜΜΑ ΜΕΤΑΠΤΥΧΙΑΚΩΝ ΣΠΟΥ ΩΝ ΣΤΑ ΠΛΗΡΟΦΟΡΙΑΚΑ ΣΥΣΤΗΜΑΤΑ "VIDEO ΚΑΤΟΠΙΝ ΖΗΤΗΣΗΣ" ΑΝΝΑ ΜΟΣΧΑ Μ 11 / 99

Web 論 文. Performance Evaluation and Renewal of Department s Official Web Site. Akira TAKAHASHI and Kenji KAMIMURA

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 19/5/2007

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

Στο εστιατόριο «ToDokimasesPrinToBgaleisStonKosmo?» έξω από τους δακτυλίους του Κρόνου, οι παραγγελίες γίνονται ηλεκτρονικά.

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΕΙΡΑΙΩΣ ΤΜΗΜΑ ΠΛΗΡΟΦΟΡΙΚΗΣ ΠΜΣ «ΠΡΟΗΓΜΕΝΑ ΣΥΣΤΗΜΑΤΑ ΠΛΗΡΟΦΟΡΙΚΗΣ» ΚΑΤΕΥΘΥΝΣΗ «ΕΥΦΥΕΙΣ ΤΕΧΝΟΛΟΓΙΕΣ ΕΠΙΚΟΙΝΩΝΙΑΣ ΑΝΘΡΩΠΟΥ - ΥΠΟΛΟΓΙΣΤΗ»

The Simply Typed Lambda Calculus

Block Ciphers Modes. Ramki Thurimella

Special edition of the Technical Chamber of Greece on Video Conference Services on the Internet, 2000 NUTWBCAM

Μηχανισμοί πρόβλεψης προσήμων σε προσημασμένα μοντέλα κοινωνικών δικτύων ΔΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ

Right Rear Door. Let's now finish the door hinge saga with the right rear door

Test Data Management in Practice

Business English. Ενότητα # 9: Financial Planning. Ευαγγελία Κουτσογιάννη Τμήμα Διοίκησης Επιχειρήσεων

EE512: Error Control Coding

ΤΜΗΜΑ ΠΛΗΡΟΦΟΡΙΚΗΣ ΠΜΣ ΕΠΙΣΤΗΜΗΣ ΥΠΟΛΟΓΙΣΤΩΝ. Διπλωματική Εργασία Μεταπτυχιακού Διπλώματος Ειδίκευσης

the total number of electrons passing through the lamp.

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

Terabyte Technology Ltd

Math 6 SL Probability Distributions Practice Test Mark Scheme

The challenges of non-stable predicates

(C) 2010 Pearson Education, Inc. All rights reserved.

ΤΕΧΝΟΛΟΓΙΚΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΥΠΡΟΥ ΤΜΗΜΑ ΝΟΣΗΛΕΥΤΙΚΗΣ

A browser-based digital signing solution over the web

Homework 3 Solutions

Εισαγωγή στα Πληροφοριακά Συστήματα. Ενότητα 11: Αρχιτεκτονική Cloud

Statistical Inference I Locally most powerful tests

EPL 603 TOPICS IN SOFTWARE ENGINEERING. Lab 5: Component Adaptation Environment (COPE)

Συστήματα Διαχείρισης Βάσεων Δεδομένων

ΑΠΟΔΟΤΙΚΗ ΑΠΟΤΙΜΗΣΗ ΕΡΩΤΗΣΕΩΝ OLAP Η ΜΕΤΑΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ ΕΞΕΙΔΙΚΕΥΣΗΣ. Υποβάλλεται στην

Εργαστήριο Ανάπτυξης Εφαρμογών Βάσεων Δεδομένων. Εξάμηνο 7 ο

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΑΤΡΩΝ ΠΟΛΥΤΕΧΝΙΚΗ ΣΧΟΛΗ ΤΜΗΜΑ ΜΗΧΑΝΙΚΩΝ Η/Υ & ΠΛΗΡΟΦΟΡΙΚΗΣ. του Γεράσιμου Τουλιάτου ΑΜ: 697

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

Other Test Constructions: Likelihood Ratio & Bayes Tests

SEMANTIC DATA CACHING AND REPLACEMENT

2 Composition. Invertible Mappings

Section 8.3 Trigonometric Equations

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΑΤΡΩΝ ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΤΕΧΝΟΛΟΓΙΑΣ ΥΠΟΛΟΓΙΣΤΩΝ ΤΟΜΕΑΣ ΣΥΣΤΗΜΑΤΩΝ ΗΛΕΚΤΡΙΚΗΣ ΕΝΕΡΓΕΙΑΣ

(1) Describe the process by which mercury atoms become excited in a fluorescent tube (3)

Potential Dividers. 46 minutes. 46 marks. Page 1 of 11

A Method for Creating Shortcut Links by Considering Popularity of Contents in Structured P2P Networks

[1] P Q. Fig. 3.1

ΕΛΛΗΝΙΚΗ ΔΗΜΟΚΡΑΤΙΑ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΡΗΤΗΣ. Ψηφιακή Οικονομία. Διάλεξη 10η: Basics of Game Theory part 2 Mαρίνα Μπιτσάκη Τμήμα Επιστήμης Υπολογιστών

Strain gauge and rosettes

Πανεπιστήμιο Πειραιώς Τμήμα Πληροφορικής Πρόγραμμα Μεταπτυχιακών Σπουδών «Πληροφορική»

Approximation of distance between locations on earth given by latitude and longitude

derivation of the Laplacian from rectangular to spherical coordinates

ΕΛΛΗΝΙΚΗ ΔΗΜΟΚΡΑΤΙΑ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΡΗΤΗΣ. Ψηφιακή Οικονομία. Διάλεξη 7η: Consumer Behavior Mαρίνα Μπιτσάκη Τμήμα Επιστήμης Υπολογιστών

C.S. 430 Assignment 6, Sample Solutions

Démographie spatiale/spatial Demography

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

ίκτυο προστασίας για τα Ελληνικά αγροτικά και οικόσιτα ζώα on.net e-foundatio // itute: toring Insti SAVE-Monit

Πώς μπορεί κανείς να έχει έναν διερμηνέα κατά την επίσκεψή του στον Οικογενειακό του Γιατρό στο Ίσλινγκτον Getting an interpreter when you visit your

ΜΕΤΑΠΤΥΧΙΑΚΟ ΠΡΟΓΡΑΜΜΑ ΣΠΟΥΔΩΝ

Υλοποίηση Δικτυακών Υποδομών και Υπηρεσιών: OSPF Cost

Section 1: Listening and responding. Presenter: Niki Farfara MGTAV VCE Seminar 7 August 2016

Προσομοίωση BP με το Bizagi Modeler

Example Sheet 3 Solutions

ΚΥΠΡΙΑΚΟΣ ΣΥΝΔΕΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY 21 ος ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ Δεύτερος Γύρος - 30 Μαρτίου 2011

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Όλνκα πνπδάζηξηαο: Γξεγνξία αββίδνπ Α.Δ.Μ:7859. Δπηβιέπνλ Καζεγεηήο: Παζραιίδεο Αζαλάζηνο ΑΝΩΣΑΣΟ ΣΔΥΝΟΛΟΓΗΚΟ ΔΚΠΑΗΓΔΤΣΗΚΟ ΗΓΡΤΜΑ ΚΑΒΑΛΑ

5.4 The Poisson Distribution.

ΑΓΓΛΙΚΑ Ι. Ενότητα 7α: Impact of the Internet on Economic Education. Ζωή Κανταρίδου Τμήμα Εφαρμοσμένης Πληροφορικής

ΑΡΙΣΤΟΤΕΛΕΙΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΘΕΣΣΑΛΟΝΙΚΗΣ

Partial Trace and Partial Transpose

Συντακτικές λειτουργίες

CMOS Technology for Computer Architects

Finite Field Problems: Solutions

Advanced Subsidiary Unit 1: Understanding and Written Response

ΕΙΣΑΓΩΓΗ ΣΤΗ ΣΤΑΤΙΣΤΙΚΗ ΑΝΑΛΥΣΗ

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 6/5/2006

Calculating the propagation delay of coaxial cable

Second Order Partial Differential Equations

Εποχές( 1. Εποχή(του(mainframe((πολλοί( χρήστες,(ένας(υπολογιστής)(( 2. Εποχή(του(PC((ένας(χρήστης,(

ΤΙΤΛΟΣ ΠΤΥΧΙΑΚΗΣ «H ΠΙΛΟΤΙΚΗ ΕΦΑΡΜΟΓΗ ΣΥΣΤΗΜΑΤΟΣ CATERING ΣE ΚΕΝΤΡΟ ΚΑΤΑΤΑΞΗΣ ΣΤΗΝ ΚΥΠΡΟ»

Χρειάζεται να φέρω μαζί μου τα πρωτότυπα έγγραφα ή τα αντίγραφα; Asking if you need to provide the original documents or copies Ποια είναι τα κριτήρια

Γιπλυμαηική Δπγαζία. «Ανθπυποκενηπικόρ ζσεδιαζμόρ γέθςπαρ πλοίος» Φοςζιάνηρ Αθανάζιορ. Δπιβλέπυν Καθηγηηήρ: Νηθφιανο Π. Βεληίθνο

ΕΘΝΙΚΟ ΜΕΤΣΟΒΙΟ ΠΟΛΥΤΕΧΝΕΙΟ

On a four-dimensional hyperbolic manifold with finite volume

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΑΤΡΩΝ ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΤΕΧΝΟΛΟΓΙΑΣ ΥΠΟΛΟΓΙΣΤΩΝ ΤΟΜΕΑΣ ΣΥΣΤΗΜΑΤΩΝ ΗΛΕΚΤΡΙΚΗΣ ΕΝΕΡΓΕΙΑΣ

ΣΕΝΑΡΙΟ ΑΝΑΦΟΡΑΣ ΣΕΝΑΡΙΟ ΤΩΝ ΜΗΧΑΝΙΚΩΝ ΣΕΝΑΡΙΟ ΣΥΝΕΡΓΑΣΙΑΣ

Fractional Colorings and Zykov Products of graphs

ΦΥΛΛΟ ΕΡΓΑΣΙΑΣ Α. Διαβάστε τις ειδήσεις και εν συνεχεία σημειώστε. Οπτική γωνία είδησης 1:.

«ΑΓΡΟΤΟΥΡΙΣΜΟΣ ΚΑΙ ΤΟΠΙΚΗ ΑΝΑΠΤΥΞΗ: Ο ΡΟΛΟΣ ΤΩΝ ΝΕΩΝ ΤΕΧΝΟΛΟΓΙΩΝ ΣΤΗΝ ΠΡΟΩΘΗΣΗ ΤΩΝ ΓΥΝΑΙΚΕΙΩΝ ΣΥΝΕΤΑΙΡΙΣΜΩΝ»

Galatia SIL Keyboard Information

Lecture 34 Bootstrap confidence intervals

ΠΑΝΔΠΙΣΗΜΙΟ ΜΑΚΔΓΟΝΙΑ ΠΡΟΓΡΑΜΜΑ ΜΔΣΑΠΣΤΥΙΑΚΧΝ ΠΟΤΓΧΝ ΣΜΗΜΑΣΟ ΔΦΑΡΜΟΜΔΝΗ ΠΛΗΡΟΦΟΡΙΚΗ

University of Macedonia Master in Information Systems. Networking Technologies professors: A. Economides A. Pobortsis AGREEMENT AND ACCOUNTING

Ψηφιακή ανάπτυξη. Course Unit #1 : Κατανοώντας τις βασικές σύγχρονες ψηφιακές αρχές Thematic Unit #1 : Τεχνολογίες Web και CMS

Paper Reference. Paper Reference(s) 1776/04 Edexcel GCSE Modern Greek Paper 4 Writing. Thursday 21 May 2009 Afternoon Time: 1 hour 15 minutes

Παλεπηζηήκην Πεηξαηώο Τκήκα Πιεξνθνξηθήο Πξόγξακκα Μεηαπηπρηαθώλ Σπνπδώλ «Πξνεγκέλα Σπζηήκαηα Πιεξνθνξηθήο»

DESIGN OF MACHINERY SOLUTION MANUAL h in h 4 0.

ΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ "ΠΟΛΥΚΡΙΤΗΡΙΑ ΣΥΣΤΗΜΑΤΑ ΛΗΨΗΣ ΑΠΟΦΑΣΕΩΝ. Η ΠΕΡΙΠΤΩΣΗ ΤΗΣ ΕΠΙΛΟΓΗΣ ΑΣΦΑΛΙΣΤΗΡΙΟΥ ΣΥΜΒΟΛΑΙΟΥ ΥΓΕΙΑΣ "

Liner Shipping Hub Network Design in a Competitive Environment

ΣΤΥΛΙΑΝΟΥ ΣΟΦΙΑ

Code Breaker. TEACHER s NOTES

Transcript:

Εαρινό Εξάμηνο 2006 web caching Γιάννος Γρηγορίου 1

Agenda Introduction-Why web caching? The Expected gains Cache Types Cache Architectures Cache Replacement policy Cache Consistency Prefetching Caching dynamic contents and multimedia streaming Conclusion and Future works 2

Introduction What is Web Caching? Introducing proxy servers at certain points in the network that serve in caching Web documents for faster client access. Comparable to the cache memory in a computer system. Why is it needed? Rapid growth in HTTP traffic (largest part of the Internet traffic) which causes more network congestion and server unavailability. The number of static web pages almost doubles every year. 3

How Web Cache Works Client Browser 1 2 7 Web Cache Web App Server Database 6 3 4 5 1. Client sends HTTP request 2. Web Cache responds immediately if cached object is available 3. If object is not in cache, Web Cache requests object from Application Server 4. Application Server generates response (may include Database queries) 5. Application Server responds to Web Cache 6. If response is cacheable, Web Cache retains a copy for subsequent requests 7. Web Cache compresses page and responds to Client

The Expected gains Bandwidth saving - it reduces the amount of bandwidth used by a client and of course this saves money Improving content availability. Improving web server availability. Reducing network latency - the request is satisfied from the cache which is closer to the client. Server load balancing. Improving user s perception about network s performance - it takes less time for a representation to be displayed. This makes the Web seem more responsive. 5

Cache Types Types of caches Client Caches - Web browsers like Internet Explorer etc. Cache stored on the local hard drive - serves one user. Proxy Caches - Placed in between (LAN network and ISP network) the client and origin server - serves many users Reverse Proxy Placed before the web server for load balancing serves the web server. 6

Terminologies Cache miss Cache hit. 7

Cache Types Cache hit Cache hit ISP Proxy server Reverse proxy router Web server Web browser cache client 8

Cache Types Cache miss Cache miss ISP Proxy server Reverse proxy router Web server Web browser cache client 9

Caching Architectures Proxy Caching Deployed at the edges of the network Unavailable cache Unavailable network Single point of failure User browser manual reconfiguration in times of failure Browser auto-reconfiguration is a recent trend client client client cache router Web (a). standalone 10

Caching Architectures Reverse Proxy Caching Placing proxies near the content provider Transparent Caching Eliminates the needs to manually configure web browsers Router-based transparent proxy caching Switch-based transparent proxy caching client client router Web client client L4 switch router Web client client cache cache (b)router-transparent (c)switch-transparent proxy caching 11

Caching Architectures A Standalone Proxy in a network. Single point of failure Bottleneck Lack of robustness Lack of scalability Solution: Cooperative caching, where caching proxies collaborate with one another in serving request. 12

Caching Architectures Hierarchical caching Caches are connected in a parentchild relationship. Tree structure where every node points to its parent Child caches are polled first Higher level caches are polled next Parent cache responsible for retrieving a fresh copy Nodes higher in the hierarchy have large user population and higher hit rates. 13

Caching Architectures Hierarchical caching Benefits: Scaleable architecture Reduces bandwidth demand on origin servers. Drawbacks: Each Hierarchy level introduces additional delays A top level server located far away from the uses may not prove to be beneficial Documents are replicated on each level Higher level caches may become bottlenecks with long queuing delays 14

Caching Architectures Co-operative (Transversal) Caching (Mesh) Mesh Topology Uses Internet cache protocol(icp) to share information, to balance loads, and to provide resistance to cache failures. When caches misses, it sends an ICP query to neighbor caches, the neighbors respond with ICP reply indicating a hit or a miss. 15

Caching Architectures Co-operative Caching (Mesh) Benefits Finds out which neighboring cache has the object Load balancing Prevents top level choking Can be configured so that neighboring caches are not queried for nearby origin servers Drawbacks ICP message overhead Replication of objects No security ( Uses UDP ) 16

Caching Architectures Hybrid A combination of Hierarchical and Mesh architecture. When caches misses, it sends a query to neighbor caches at the same level (siblings) using multicast before sending a cache request to a parent. Combines the advantages of the 2 previous architectures and minimized their disadvantages. 17

A Typical Caching Architecture WEB server memory ISP proxy server L2 cache L1 cache LAN proxy server CPU client Web browser cache 18

Caching Architectures memory WEB servers ISP proxy servers Shared cache CYTA proxy server UCY proxy server Bus EAC proxy server COOP proxy server cache cache cache cache Proc 1 Proc 2 Proc 3 Proc 4 clients 19

Web Caching Software CERN httpd is the original Web server and proxy cache. It is not used much these days. Squid is freely available, open source software that runs on most Unix platforms. Squid is likely the most widely deployed caching solution and full of many useful (and some not-so-useful) features. Microsoft has a high-end product called ISA server. The older Microsoft Proxy Server is a software product that runs on Windows NT. Cisco has an appliance called the Cache Engine. The product, which is small and modular, is designed for clustering with WCCP. Oops! is a proxy server; the main aims of its development being stable operation, service speed, main protocols support, modularity, ease at use. Oracle9iAS Web Cache combines caching, compression and assembly technologies to accelerate the delivery of both static and dynamically generated Web content. It also implements Edge Side Include (ESI) protocol advanced by Oracle, Akamai, and others. And many others 20

Cache Replacement Policy Purpose is to decide which items to evict when there is no more space in the cache. an old item to be replaced by a new one The effectiveness of a cache primarily depend on the accuracy of the replacement decisions that are taken upon the cache s saturation. 21

Cache Replacement Policy Well known cache replacement algorithms Least recently used (LRU) - evicts the item that was requested least recently. Least frequently used (LFU) - evicts the item that was least frequently used. Size - evicts the largest(max) or the smallest (MIN) object Latency Access Time (LAT) - evicts the document with the lowest download latency first. Greedy Dual Size - evicts objects with the least cost/size. LRU-MIN - minimizes the number of objects to be removed upon saturation HYB (Hybrid) an optimal combination of LAT-LFU-SIZE 22

Cache Consistency Purpose is to avoid serving an outdated cached copy of an item to the users. Items kept in cache may not be consistent with the original copy, if the original copy changes faster than the cache refresh frequency. 23

Cache Consistency Approaches to maintain cache consistency Polling-Every-Time (PET) Time-To-Live (TTL) based validation Server Invalidation 24

Cache Consistency Polling-Every-Time (PET) Clients send an If modified since request to the server on every cache hit. If the document is modified since time specified, new data is sent otherwise no data is sent. Drawback: waste the network bandwidth if the cached copy has not yet updated on the original server. 25

Cache Consistency Time-To-Live (TTL) based validation. The proxy assign a TTL value for each object Drawback: an object can expires but has not yet updated on the original server, the proxy still needs to verify with the server. Proactive Polling a TTL based validation where the proxy proactively poll the server to check the validity of the cache object either at a fixed interval or an adaptive interval. Drawback: waste the network bandwidth if the cached copy is rarely accessed after validation. To overcome this problem one may batch the validation request and responses over normal HTTP traffic. 26

Cache Consistency Server Invalidation Web Server detects an object change and sends invalidation messages to all caches that have recently accessed it Upon receiving the invalidation message the caches delete the object and do not retrieve a new copy Benefits: Strong cache consistency Reduced network transactions Improved cache utilization (by deleting stale copies) Drawbacks: Storage needed on the web server to keep track of caches CPU Overhead - search and update lists 27

Cache Consistency Weak Consistency TTL-based validation Proactive polling Strong Consistency Polling Every Time (PET) Server Invalidation 28

What to cache? Objects with long freshness time small in size They have high access latency are best suited for caching 29

What not to cache? Objects that cannot be normally cached are: Objects that are password protected Any file exceeding a predefined limit SSL requests which are tunneled Any URL with /cgi-bin/ Dynamic web contents If the origin server wants to record the exact number of times their page is viewed. they may decide not to allow their documents to be cacheable (known as cache busting) 30

Advantages / Disadvantages of web caching Advantages : Decreases network congestion Reduces access latency Reduces work load on the origin server In case of origin server crash, a client can obtain a cached copy Information distributed more widely at a lower cost Disadvantages: Stale data due to lack of proper proxy updating Access latency may increase in case of a cache miss A single proxy server is a single point of failure Proxy bottleneck 31

Prefetching Proactively preloads data from the web server into cache to facilitate near-future accesses. Prefetching policy issues: Avoid overloading the network Avoid overflowing the cache s storage space Prefetching should only apply to user request and not to requests from neighbors Prefetched document should remain in cache long enough to have a chance to be accessed When a document is removed from the cache, should also removed all the documents prefetched for that document. 32

Prefetching Prefetching methods and approaches Statistics-based approaches (user behavior) Prefetching documents near the top of the requested document 33

Prefetching Policies Mixed-access pattern Prediction makes use of aggregate access from different clients but does not explore which client makes the request Per-client access pattern Analyze access patterns on per-client basis and use the aggregate patterns for predictions Object structural information based Prefetched the hyperlinks for the accessed document. 34

Caching dynamic contents Dynamically Generated Pages Increased due to Database centric e-commerce application Frequently update contents Personalization Proxy caching is ineffective for such pages 35

Caching dynamic contents How to cache Dynamic Contents? Policy based caching Cache the generation process of the dynamic pages and not the pages them selves. Caching query results Scalable and transparent architecture possibility Caching databases contents Limited scalability and transparency Caching the application (caching applets from web servers) Partial-Page caching (split web page into templates and fragments) Combination of the above i.e. DOM Proxy : a combination of the application and query results caching. 36

Caching multimedia streaming Caching streaming objects (audio and video) Multimedia objects have distinct features such as, Large size Long duration, Intensive use of bandwidth Interactivity Properties to be study for the design of an effective caching mechanism: Streams do not require to be delivered at once. The server usually pipelines the data to the client through the network Multimedia streams are able to change their size by adjusting the quality 37

Caching multimedia streaming What we want to succeed by caching the multimedia streaming objects? Maximize quality of the delivery stream to the clients Minimize load on the server Minimize the startup latency Provide low-latency recording functionality for the clients The main challenge for a proxy caching of Internet multimedia streams is the need for congestion control. Quality Adaptation: Change the compression ratio according to available network bandwidth Problem: there is no correlation between the variation on quality of the cache stream and the required quality for the new session. 38

Caching multimedia streaming Conventional Proxy caching do not perform effectively and in some cases are inapplicable. Approaches of caching multimedia objects Partial Caching Algorithm: a small portion of the video is stored on the proxy and as a result the network resources consumed could be significantly reduced. Problems: How to optimally choose the portion to cache? How for the proxy and the origin server to jointly deliver streaming objects to clients? Streaming objects often have Variable bit rate and have high demands on transmission delays or delay jitter. Layered Encoding: compressed data is split into a base layer which contains the most essential low quality information. The other layers provides optional enhancement information. 39

Conclusion Bandwidth Web client Web Cache Web Cache 40

Conclusion Benefits of web caching and prefetching Reduce latency Improved performance of WWW Balance workload of web servers Decrease effects of network bottleneck Side effects of web caching and prefetching Deploy proxy servers may cost more Increase complexity of maintenance Caching and Prefetching themselves will consume bandwidth 41

New Era WEB server client 1. Database driven web content SIP proxy server 2. E-commerce Internet 3. Anytime and Everywhere 4. Voice over Internet-Real Time 5. Video over Internet- Real Time 6. Security sensitive content ISP proxy server GSM 3G network LAN proxy server Mobile proxy server client Web browser cache client client 42

The future for web caching Content Security Proxy and Firewall Access control Virus scanning Content filtering Handling more complex objects and real-time data Dynamic content Databases Mobile Proxy caching Content Distribution Networks( CDN) Real time engine: captures, caches and queries data at speeds greater than 12000 event/s Web Caching based on Ontology User access pattern prediction Prefetching Cache replacement algorithms 43

References 1. Web Caching by Geoff Huston, Telstra. The Internet Protocol Journal 2. A survey of Proxy Cache Evaluation Techniques, Brian D. Davison, The State University of New Jersey 3. Improving the Effectiveness of Web Caching Jean-Marc, Valerie Issarny, Michel Banatre, INRIA 4. Caching and Prefetching for Web Content Distribution Jianliang Xu, Jiangchuan Liu, Bo Li, Xiaohua Jia 5. Web Caching Prefetching as an Aspect: Towards a Dynamic- Weaving Based Solution, Marc Segura-Devillechaise, Jean-Marc Menaud, Gilles Muller, Obasco Group 6. DOMProxy: Enabling Dynamic-Content Front-end Web Caching Manolis Valiskakis, John Roussos, Panos Georgantas, Timos Sellis, National Technical University of Athens 7. Proxy Caching for Multimedia Playback Streams in the Internet, Reza Rejaie, Mark Handley, Haobo Yu, Deborah Estrin, University of Southern California 8. Offload Your Databases. A Web Caching Primer for DBAS and Architects, Howard Cohen, Jesse Anton, Oracle Corporation 44

45