Introduction to Web and HTTP Protocols in Data Communication Networks

 
CIS454/554
CIS454/554
Data Comm. Networks
Data Comm. Networks
 
Lecture 4
Lecture 4
 
Wenbing Zhao
Wenbing Zhao
 
(Part of the slides are based on Drs. Kurose & Ross
(Part of the slides are based on Drs. Kurose & Ross
s
s
slides for their 
slides for their 
Computer Networking 
Computer Networking 
book
book
)
)
 
5/13/2024
 
1
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Administrative:
Lab report requirement:
Submit in blackboard
T
y
p
e
d
,
 
m
u
s
t
 
i
n
c
l
u
d
e
 
q
u
e
s
t
i
o
n
s
/
t
a
s
k
s
,
 
y
o
u
r
 
a
n
s
w
e
r
s
,
 
a
n
d
s
c
r
e
e
n
s
h
o
t
s
 
t
o
 
b
a
c
k
u
p
 
y
o
u
r
 
a
n
s
w
e
r
s
Today
s topics
Web and HTTP
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
The World Wide Web
The World Wide Web
 
Creation of Tim Berners-Lee, in 1989 CERN nuclear
physics research
Mosaic – first graphical interface, creation of Marc Andersson
(and others), precursor to Netscape
Uses a client-server architecture
Web server
Web browser
Runs on HTTP over TCP
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Web and HTTP
Web and HTTP
 
Web page
 consists of 
objects
Object can be HTML file, JPEG image, Java applet,
audio file,…
A Web page consists of a 
base HTML-file
 which
includes several referenced objects
Each object is addressable by a 
URL
T
h
e
 
i
d
e
a
 
o
f
 
h
a
v
i
n
g
 
o
n
e
 
p
a
g
e
 
p
o
i
n
t
 
t
o
 
a
n
o
t
h
e
r
 
i
s
 
c
a
l
l
e
d
h
y
p
e
r
t
e
x
t
Invented by Vannevar Bush, a MIT EE professor, in 1945
 
5/13/2024
 
5
 
HTML – HyperText Markup Language
 
The HTML for a sample Web page
 
The formatted page
 
5/13/2024
 
6
 
Common HTML Tags
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
URL – Uniform Resource Locater
URL – Uniform Resource Locater
 
Example URL:
 
 
 
URL encodes three types of information
W
h
a
t
 
i
s
 
t
h
e
 
p
a
g
e
 
c
a
l
l
e
d
 
 
l
o
c
a
l
 
p
a
t
h
 
n
a
m
e
 
u
n
i
q
u
e
l
y
i
n
d
i
c
a
t
i
n
g
 
t
h
e
 
s
p
e
c
i
f
i
c
 
p
a
g
e
W
h
e
r
e
 
i
s
 
t
h
e
 
p
a
g
e
 
l
o
c
a
t
e
d
 
 
H
o
s
t
 
n
a
m
e
 
o
f
 
t
h
e
 
s
e
r
v
e
r
 
o
n
w
h
i
c
h
 
t
h
e
 
p
a
g
e
 
i
s
 
l
o
c
a
t
e
d
H
o
w
 
c
a
n
 
t
h
e
 
p
a
g
e
 
b
e
 
a
c
c
e
s
s
e
d
 
 
p
r
o
t
o
c
o
l
,
 
e
.
g
.
,
 
h
t
t
p
,
 
f
t
p
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Overview
HTTP Overview
 
HTTP: HyperText
Transfer Protocol
Web
s application layer
protocol
client/server model
HTTP 1.0: RFC 1945
HTTP 1.1: RFC 2068
 
PC running
Explorer
 
Server
running
Apache Web
server
 
Mac running
Navigator
 
HTTP request
 
HTTP request
 
HTTP response
 
HTTP response
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Overview
HTTP Overview
 
Client initiates TCP connection (creates
socket) to server, port 80
Server accepts TCP connection from client
HTTP messages (application-layer protocol
messages) exchanged between browser
(HTTP client) and Web server (HTTP server)
TCP connection closed
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Overview
HTTP Overview
 
HTTP is 
stateless
Server maintains no information about past client
requests
Protocols that maintain 
state
 are complex!
Past history (state) must be maintained
If server/client crashes, their views of 
state
 may
be inconsistent, must be reconciled
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Connections
HTTP Connections
 
Nonpersistent HTTP
At most one object is
sent over a TCP
connection
HTTP/1.0 uses
nonpersistent HTTP
 
Persistent HTTP
Multiple objects can be
sent over single TCP
connection between
client and server
HTTP/1.1 uses
persistent connections
in default mode
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Nonpersistent HTTP
Nonpersistent HTTP
 
Suppose user enters URL
http://www.someSchool.edu/someDept/home.index
 
1a
.
 HTTP client initiates TCP
connection to HTTP server at
www.someSchool.edu on port
80
 
2.
 HTTP
 client sends HTTP
request message
 (containing
URL) into TCP connection
socket. Message indicates
that client wants object
someDept/home.index
 
1b.
 HTTP
 server at host
www.someSchool.edu
waiting for TCP connection
at port 80.  
accepts
connection, notifying client
 
3.
 HTTP
 server receives
request message, forms
response message
containing requested object,
and sends message into its
socket
time
 
(contains text, references to 10 jpeg images)
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Nonpersistent HTTP
Nonpersistent HTTP
 
5
.
 HTTP client receives response
message containing html file,
displays html.  Parsing html
file, finds 10 referenced jpeg
objects
 
6.
 
Steps 1-5 repeated for each
of 10 jpeg objects
 
4.
 HTTP
 server closes TCP
connection.
 
time
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Persistent HTTP:
Non-Persistent HTTP:
Response Time
Response Time
 
Definition of RTT:
 time to
send a small packet to
travel from client to
server and back (Round
Trip Time)
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Persistent HTTP: Response Time
Non-Persistent HTTP: Response Time
 
Response time:
one RTT to initiate TCP connection
one RTT for HTTP request and first few bytes
of HTTP response to return
file transmission time
Total = 2RTT+transmission time
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Persistent HTTP Issues
 
Requires 2 RTTs per object
OS overhead for 
each
 TCP connection
To reduce response time, browsers
often open parallel TCP connections to
fetch referenced objects
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Persistent HTTP
Persistent HTTP
 
Server leaves connection open after sending
response
Subsequent HTTP messages between same
client/server sent over open connection
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Persistent HTTP
Persistent HTTP
 
P
e
r
s
i
s
t
e
n
t
 
w
i
t
h
o
u
t
 
p
i
p
e
l
i
n
i
n
g
:
Client issues new request only when previous response has
been received
One RTT for each referenced object
P
e
r
s
i
s
t
e
n
t
 
w
i
t
h
 
p
i
p
e
l
i
n
i
n
g
:
Default in HTTP/1.1
M
u
l
t
i
p
l
e
 
r
e
q
u
e
s
t
s
 
a
r
e
 
s
e
n
t
 
o
v
e
r
 
t
h
e
 
s
a
m
e
 
c
o
n
n
e
c
t
i
o
n
c
o
n
c
u
r
r
e
n
t
l
y
.
 
T
h
a
t
 
i
s
,
 
a
f
t
e
r
 
t
h
e
 
f
i
r
s
t
 
r
e
q
u
e
s
t
,
 
t
h
e
 
s
e
c
o
n
d
 
r
e
q
u
e
s
t
i
s
 
s
e
n
t
 
b
e
f
o
r
e
 
t
h
e
 
r
e
p
l
y
 
f
o
r
 
t
h
e
 
f
i
r
s
t
 
r
e
q
u
e
s
t
 
i
s
 
r
e
c
e
i
v
e
d
As little as one RTT for all the referenced objects
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Request Message
HTTP Request Message
 
Two types of HTTP messages: 
request
, 
response
HTTP request message:
ASCII (human-readable format)
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Request Message:
HTTP Request Message:
General Format
General Format
 
HTTP header is pure ASCII based. It is very different from
lower layer protocols such as TCP, which is binary based
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Method Types
Method Types
 
HTTP/1.0
GET
POST
HEAD
Asks server to include
only the header part in
response
 
HTTP/1.1
GET, POST, HEAD
PUT
Uploads file in entity
body to path specified in
URL field
DELETE
Deletes file specified in
the URL field
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Response Message
HTTP Response Message
 
HTTP/1.1 200 OK
Connection close
Date: Thu, 06 Aug 1998 12:00:15 GMT
Server: Apache/1.3.0 (Unix)
Last-Modified: Mon, 22 Jun 1998 …...
Content-Length: 6821
Content-Type: text/html
data data data data data ...
 
status line
(protocol
status code
status phrase)
 
header
 lines
 
data, e.g.,
requested
HTML file
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
HTTP Response Status Codes
HTTP Response Status Codes
 
200 OK
request succeeded, requested object later in this message
301 Moved Permanently
requested object moved, new location specified later in this
message (Location:)
400 Bad Request
request message not understood by server
404 Not Found
requested document not found on this server
505 HTTP Version Not Supported
 
Status code is in first line of the response message:
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Web Caching
Web Caching
 
user sets browser: Web
accesses via  proxy
server
browser sends all HTTP
requests to  proxy server
object in cache: returns
cached object
else cache requests
object from origin server,
then returns object to
client
 
Goal:
 satisfy client request without involving origin server
 
client
 
Proxy
server
 
client
 
HTTP request
 
HTTP request
 
HTTP response
 
HTTP response
 
HTTP request
 
HTTP response
 
origin
server
 
origin
server
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
More about Web Caching
More about Web Caching
 
Proxy server acts as both
client and server
Typically proxy server is
installed by ISP (university,
company, residential ISP)
 
Why Web caching?
Reduce response time for
client request
Reduce traffic on an
institution
s access link
Internet dense with caches:
enables 
poor
 content
providers to effectively
deliver content
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Conditional GET: HTTP Build-in
Conditional GET: HTTP Build-in
Support for Caching
Support for Caching
 
Goal:
 don
t send object if
cache is up-to-date
Proxy server: specify date of
cached copy in HTTP request
If-modified-since: <date>
Origin server: response
contains no object if cached
copy is up-to-date:
HTTP/1.0 304 Not Modified
 
Proxy
server
 
Origin
Server
HTTP request msg
If-modified-since:
<date>
 
object
not
modified
HTTP request msg
If-modified-since:
<date>
HTTP response
HTTP/1.0 200 OK
<data>
 
object
modified
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Caching Example
Non-Caching Example
 
Assumptions
Average object size =
100,000 bits
Avg. request rate from
institution
s browsers to
origin servers = 15/sec
Delay from institutional
router to any origin server
and back to router  = 2 sec
 
origin
servers
 
public
 Internet
 
institutional
network
 
10 Mbps LAN
 
1.5 Mbps
access link
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Caching Example
Non-Caching Example
 
Consequences
Utilization on LAN = 15%
Utilization on access link =
100%
Total delay   = Internet delay
+ access delay + LAN delay
 
 
=
 
 
2
 
s
e
c
 
+
 
m
i
n
u
t
e
s
 
+
m
i
l
l
i
s
e
c
o
n
d
s
 
origin
servers
 
public
 Internet
 
institutional
network
 
10 Mbps LAN
 
1.5 Mbps
access link
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Non-Caching Example
Non-Caching Example
 
Possible solution
Increase bandwidth of
access link to, say, 10 Mbps
Consequences
Utilization on LAN = 15%
Utilization on access link = 15%
Total delay   = Internet delay +
access delay + LAN delay
  =  2 sec + msecs + msecs
Often a costly upgrade
 
origin
servers
 
public
 Internet
 
institutional
network
 
10 Mbps LAN
 
10 Mbps
access link
 
5/13/2024
 
Wenbing Zhao
Wenbing Zhao
 
Caching Example
Caching Example
 
Install proxy server
Suppose hit rate is 0.4
Consequence
40% requests will be satisfied almost
immediately
60% requests satisfied by origin
server
Utilization of access link reduced
to 60%, resulting in negligible
delays (say 10 msec)
Total avg delay   = Internet delay +
access delay + LAN delay   =
.6*(2.01) secs  + .4*milliseconds <
1.4 secs
 
origin
servers
 
public
 Internet
 
institutional
network
 
10 Mbps LAN
 
1.5 Mbps
access link
 
Institutional
Proxy server
 
5/13/2024
 
31
 
Homework#1.3
Homework#1.3
 
Objective
 
3:
 
Able
 
to
 
understand
 
how
 
a
 
Uniform Resource Locater
 
is
constructed
 
and
 
its
 
internal
 
components
Problem
. Imagine that someone in the CS Department at Stanford
has just written a new program that he wants to distribute by FTP.
He puts the program in the FTP directory
ftp/pub/freebies/newprog.c. What is the URL for this program likely
to be?
 
5/13/2024
 
32
 
Homework#1.4
Homework#1.4
 
Objective
 
4:
 
Able
 
to
 
understand
 
the
 
built-in
 
support
 
for
 
caching
 
in
HTTP
 
and
 
the
 
HTTP
 
protocol
 
itself
Key
 
point:
HTTP
 
has
 
support
 
for
 
a
 
conditional
 
get
 
request
 
to
 
work
 
with
 
the
 
caching
 
so
that
 
the
 
network
 
traffic
 
can
 
be
 
reduced
 
and
 
the
 
end-to-end
 
latency
 
is
shortened
Problem
. The If-Modified-Since header can be used to check
whether a cached page is still valid. Requests can be made for
pages containing images, sound, video, and so on, as well as
HTML. Do you think the effectiveness of this technique is better
or worse for JPEG images as compared to HTML?
 
5/13/2024
 
33
 
Homework#1.5
Homework#1.5
 
Objective
 
5:
 
Able
 
to
 
compute
 
the
 
response
 
time
 
as
 
seen
 
by
 
a
 
client
 
that
 
issued
 
a
HTTP
 
request.
Key
 
point:
Queueing
 
delay
 
at
 
the
 
router
 
can
 
be
 
a
 
significant
 
factor
 
in
 
response
 
time
The
 
heavier
 
the
 
traffic,
 
the
 
longer
 
the
 
queue
 
at
 
the
 
router,
 
which
 
causes
 
the
 
longer
 
delay
Using
 
caching,
 
a
 
portion
 
of
 
the
 
HTTP
 
requests
 
will
 
be
 
served
 
at
 
the
 
local
 
proxy
 
server
instead
 
of
 
computing
 
for
 
resources
 
at
 
the
 
router,
 
which
 
reduces
 
the
 
queue
 
at
 
the
 
router
and
 
shorten
 
the
 
response
 
time
Problem
. 
Consider an institutional network connected to the internet. Suppose that the average object
size is 900,000 bits and that the average request rate from the institution
s browsers to the origin
servers is 1.5 requests per second. The bandwidth of the access link is 1.5 Mbps. Also suppose that
the amount of time it takes from when the router on the Internet side of the access link forwards an
HTTP request until it receives the response in two seconds on average. Model the total average
response time as the sum of the average access delay (that is, the delay from Internet router to
institution router) and the average Internet delay. For the average access delay, use 
/(1-

), where 
is the average time required to send an object over the access link and 
 is the arrival rate of objects
to the access link.
(1)
 
Find the total average response time.
(2) Now suppose a cache is installed in the institutional LAN. Suppose the hit rate is 0.4. Find the total average response
time
 
34
 
Homework#1.6
 
Objective 6: Able to fully understand HTTP
Key points
HTTP has two versions so far: 1.0 and 1.1
HTTP 1.0: TCP connection is not persistent, that is, to retrieve any object, a new
TCP connection is made and then is terminated
HTTP 1.1: Support persistent TCP connection in that the same connection is used
to retrieve multiple objects if necessary
TCP uses a three-way handshake to establish a new connection, which adds a
round trip delay by itself
P
r
o
b
l
e
m
:
 
S
u
p
p
o
s
e
 
w
i
t
h
i
n
 
y
o
u
r
 
W
e
b
 
b
r
o
w
s
e
r
 
y
o
u
 
c
l
i
c
k
 
o
n
 
a
 
l
i
n
k
 
t
o
 
o
b
t
a
i
n
 
a
 
W
e
b
 
p
a
g
e
.
T
h
e
 
I
P
 
a
d
d
r
e
s
s
 
f
o
r
 
t
h
e
 
a
s
s
o
c
i
a
t
e
d
 
U
R
L
 
h
a
s
 
a
l
r
e
a
d
y
 
b
e
e
n
 
c
a
c
h
e
d
 
i
n
 
y
o
u
r
 
l
o
c
a
l
 
h
o
s
t
 
s
o
a
 
D
N
S
 
l
o
o
k
-
u
p
 
i
s
 
u
n
n
e
c
e
s
s
a
r
y
.
 
S
u
p
p
o
s
e
 
t
h
a
t
 
t
h
e
 
W
e
b
 
p
a
g
e
 
a
s
s
o
c
i
a
t
e
d
 
w
i
t
h
 
t
h
e
 
l
i
n
k
c
o
n
t
a
i
n
s
 
s
o
m
e
 
H
T
M
L
 
t
e
x
t
 
a
n
d
 
2
0
 
j
p
e
g
 
i
m
a
g
e
s
 
t
h
a
t
 
a
r
e
 
h
o
s
t
e
d
 
o
n
 
t
h
e
 
s
a
m
e
 
W
e
b
s
e
r
v
e
r
.
 
L
e
t
 
R
T
T
0
 
d
e
n
o
t
e
 
t
h
e
 
R
T
T
 
b
e
t
w
e
e
n
 
t
h
e
 
l
o
c
a
l
 
h
o
s
t
 
a
n
d
 
t
h
e
 
W
e
b
 
s
e
r
v
e
r
.
A
s
s
u
m
i
n
g
 
0
 
t
r
a
n
s
m
i
s
s
i
o
n
 
t
i
m
e
 
o
f
 
t
h
e
 
H
T
M
L
 
t
e
x
t
 
a
n
d
 
t
h
e
 
e
m
b
e
d
d
e
d
 
i
m
a
g
e
s
,
 
c
a
l
c
u
l
a
t
e
h
o
w
 
m
u
c
h
 
t
i
m
e
 
e
l
a
p
s
e
s
 
f
r
o
m
 
w
h
e
n
 
t
h
e
 
c
l
i
e
n
t
 
c
l
i
c
k
s
 
o
n
 
t
h
e
 
l
i
n
k
 
u
n
t
i
l
 
t
h
e
 
c
l
i
e
n
t
 
r
e
c
e
i
v
e
s
t
h
e
 
W
e
b
 
p
a
g
e
 
(
H
T
M
L
 
t
e
x
t
 
a
n
d
 
t
h
e
 
e
m
b
e
d
d
e
d
 
i
m
a
g
e
s
)
 
u
n
d
e
r
 
t
h
e
 
f
o
l
l
o
w
i
n
g
 
s
c
e
n
a
r
i
o
s
:
HTTP 1.1 with pipelining (and with persistent connection)
HTTP 1.1 without pipelining (and with persistent connection)
 
5/13/2024
Slide Note
Embed
Share

Explore the fundamental concepts of the World Wide Web and Hypertext Transfer Protocol (HTTP) as integral parts of data communication networks. Delve into the history, structure, and key components of the web, including HTML, common tags, URLs, and HTTP. Understand how information is accessed, transmitted, and displayed on web pages through client-server communication models. Gain insights into the evolution and significance of web technologies in modern networking landscapes.

  • Web protocols
  • HTTP
  • Data communication
  • Networking fundamentals
  • Internet technology

Uploaded on May 13, 2024 | 1 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. CIS454/554 Data Comm. Networks Lecture 4 Wenbing Zhao (Part of the slides are based on Drs. Kurose & Ross s slides for their Computer Networking book) 5/13/2024 1

  2. Administrative: Lab report requirement: Submit in blackboard Typed, must include questions/tasks, your answers, and screenshots to backup your answers Today s topics Web and HTTP 5/13/2024 Wenbing Zhao

  3. The World Wide Web Creation of Tim Berners-Lee, in 1989 CERN nuclear physics research Mosaic first graphical interface, creation of Marc Andersson (and others), precursor to Netscape Uses a client-server architecture Web server Web browser Runs on HTTP over TCP 5/13/2024 Wenbing Zhao

  4. Web and HTTP Web page consists of objects Object can be HTML file, JPEG image, Java applet, audio file, A Web page consists of a base HTML-file which includes several referenced objects Each object is addressable by a URL The idea of having one page point to another is called hypertext Invented by Vannevar Bush, a MIT EE professor, in 1945 5/13/2024 Wenbing Zhao

  5. HTML HyperText Markup Language The HTML for a sample Web page The formatted page 5/13/2024 5

  6. Common HTML Tags 5/13/2024 6

  7. URL Uniform Resource Locater Example URL: http://www.someschool.edu/someDept/pic.gif path name host name protocol name URL encodes three types of information What is the page called local path name uniquely indicating the specific page Where is the page located Host name of the server on which the page is located How can the page be accessed protocol, e.g., http, ftp 5/13/2024 Wenbing Zhao

  8. HTTP Overview HTTP: HyperText Transfer Protocol Web s application layer protocol client/server model HTTP 1.0: RFC 1945 HTTP 1.1: RFC 2068 PC running Explorer Server running Apache Web server Mac running Navigator 5/13/2024 Wenbing Zhao

  9. HTTP Overview Client initiates TCP connection (creates socket) to server, port 80 Server accepts TCP connection from client HTTP messages (application-layer protocol messages) exchanged between browser (HTTP client) and Web server (HTTP server) TCP connection closed 5/13/2024 Wenbing Zhao

  10. HTTP Overview HTTP is stateless Server maintains no information about past client requests Protocols that maintain state are complex! Past history (state) must be maintained If server/client crashes, their views of state may be inconsistent, must be reconciled 5/13/2024 Wenbing Zhao

  11. HTTP Connections Nonpersistent HTTP At most one object is sent over a TCP connection HTTP/1.0 uses nonpersistent HTTP Persistent HTTP Multiple objects can be sent over single TCP connection between client and server HTTP/1.1 uses persistent connections in default mode 5/13/2024 Wenbing Zhao

  12. Nonpersistent HTTP Suppose user enters URL http://www.someSchool.edu/someDept/home.index 1a. HTTP client initiates TCP connection to HTTP server at www.someSchool.edu on port 80 (contains text, references to 10 jpeg images) 1b. HTTP server at host www.someSchool.edu waiting for TCP connection at port 80. accepts connection, notifying client 2. HTTP client sends HTTP request message (containing URL) into TCP connection socket. Message indicates that client wants object someDept/home.index 3. HTTP server receives request message, forms response message containing requested object, and sends message into its socket time 5/13/2024 Wenbing Zhao

  13. Nonpersistent HTTP 4. HTTP server closes TCP connection. 5. HTTP client receives response message containing html file, displays html. Parsing html file, finds 10 referenced jpeg objects 6. Steps 1-5 repeated for each of 10 jpeg objects time 5/13/2024 Wenbing Zhao

  14. Non-Persistent HTTP: Response Time Definition of RTT: time to send a small packet to travel from client to server and back (Round Trip Time) initiate TCP connection RTT request file time to transmit file RTT file received time time 5/13/2024 Wenbing Zhao

  15. Non-Persistent HTTP: Response Time Response time: one RTT to initiate TCP connection one RTT for HTTP request and first few bytes of HTTP response to return file transmission time Total = 2RTT+transmission time 5/13/2024 Wenbing Zhao

  16. Non-Persistent HTTP Issues Requires 2 RTTs per object OS overhead for each TCP connection To reduce response time, browsers often open parallel TCP connections to fetch referenced objects 5/13/2024 Wenbing Zhao

  17. Persistent HTTP Server leaves connection open after sending response Subsequent HTTP messages between same client/server sent over open connection 5/13/2024 Wenbing Zhao

  18. Persistent HTTP Persistent without pipelining: Client issues new request only when previous response has been received One RTT for each referenced object Persistent with pipelining: Default in HTTP/1.1 Multiple requests are sent over the same connection concurrently. That is, after the first request, the second request is sent before the reply for the first request is received As little as one RTT for all the referenced objects 5/13/2024 Wenbing Zhao

  19. HTTP Request Message Two types of HTTP messages: request, response HTTP request message: ASCII (human-readable format) request line (GET, POST, HEAD commands) GET /somedir/page.html HTTP/1.1 Host: www.someschool.edu User-agent: Mozilla/4.0 Connection: close Accept-language:fr header lines Carriage return, line feed indicates end of message (extra carriage return, line feed) 5/13/2024 Wenbing Zhao

  20. HTTP Request Message: General Format HTTP header is pure ASCII based. It is very different from lower layer protocols such as TCP, which is binary based 5/13/2024 Wenbing Zhao

  21. Method Types HTTP/1.0 GET POST HEAD Asks server to include only the header part in response HTTP/1.1 GET, POST, HEAD PUT Uploads file in entity body to path specified in URL field DELETE Deletes file specified in the URL field 5/13/2024 Wenbing Zhao

  22. HTTP Response Message status line (protocol status code status phrase) HTTP/1.1 200 OK Connection close Date: Thu, 06 Aug 1998 12:00:15 GMT Server: Apache/1.3.0 (Unix) Last-Modified: Mon, 22 Jun 1998 ... Content-Length: 6821 Content-Type: text/html data data data data data ... header lines data, e.g., requested HTML file 5/13/2024 Wenbing Zhao

  23. HTTP Response Status Codes Status code is in first line of the response message: 200 OK request succeeded, requested object later in this message 301 Moved Permanently requested object moved, new location specified later in this message (Location:) 400 Bad Request request message not understood by server 404 Not Found requested document not found on this server 505 HTTP Version Not Supported 5/13/2024 Wenbing Zhao

  24. Web Caching Goal: satisfy client request without involving origin server user sets browser: Web accesses via proxy server browser sends all HTTP requests to proxy server object in cache: returns cached object else cache requests object from origin server, then returns object to client origin server Proxy server client client origin server 5/13/2024 Wenbing Zhao

  25. More about Web Caching Proxy server acts as both client and server Typically proxy server is installed by ISP (university, company, residential ISP) Why Web caching? Reduce response time for client request Reduce traffic on an institution s access link Internet dense with caches: enables poor content providers to effectively deliver content 5/13/2024 Wenbing Zhao

  26. Conditional GET: HTTP Build-in Support for Caching Origin Server Proxy server HTTP request msg If-modified-since: <date> Goal: don t send object if cache is up-to-date Proxy server: specify date of cached copy in HTTP request If-modified-since: <date> Origin server: response contains no object if cached copy is up-to-date: HTTP/1.0 304 Not Modified object not modified HTTP response HTTP/1.0 304 Not Modified HTTP request msg If-modified-since: <date> object modified HTTP response HTTP/1.0 200 OK <data> 5/13/2024 Wenbing Zhao

  27. Non-Caching Example origin servers Assumptions Average object size = 100,000 bits Avg. request rate from institution s browsers to origin servers = 15/sec Delay from institutional router to any origin server and back to router = 2 sec public Internet 1.5 Mbps access link institutional network 10 Mbps LAN 5/13/2024 Wenbing Zhao

  28. Non-Caching Example origin servers Consequences Utilization on LAN = 15% Utilization on access link = 100% Total delay = Internet delay + access delay + LAN delay = 2 sec + minutes + milliseconds public Internet 1.5 Mbps access link institutional network 10 Mbps LAN 5/13/2024 Wenbing Zhao

  29. Non-Caching Example origin servers Possible solution Increase bandwidth of access link to, say, 10 Mbps Consequences Utilization on LAN = 15% Utilization on access link = 15% Total delay = Internet delay + access delay + LAN delay = 2 sec + msecs + msecs Often a costly upgrade public Internet 10 Mbps access link institutional network 10 Mbps LAN 5/13/2024 Wenbing Zhao

  30. Caching Example origin servers Install proxy server Suppose hit rate is 0.4 Consequence 40% requests will be satisfied almost immediately 60% requests satisfied by origin server Utilization of access link reduced to 60%, resulting in negligible delays (say 10 msec) Total avg delay = Internet delay + access delay + LAN delay = .6*(2.01) secs + .4*milliseconds < 1.4 secs public Internet 1.5 Mbps access link institutional network 10 Mbps LAN Institutional Proxy server 5/13/2024 Wenbing Zhao

  31. Homework#1.3 Objective 3: Able to understand how a Uniform Resource Locater is constructed and its internal components Problem. Imagine that someone in the CS Department at Stanford has just written a new program that he wants to distribute by FTP. He puts the program in the FTP directory ftp/pub/freebies/newprog.c. What is the URL for this program likely to be? 5/13/2024 31

  32. Homework#1.4 Objective 4: Able to understand the built-in support for caching in HTTP and the HTTP protocol itself Key point: HTTP has support for a conditional get request to work with the caching so that the network traffic can be reduced and the end-to-end latency is shortened Problem. The If-Modified-Since header can be used to check whether a cached page is still valid. Requests can be made for pages containing images, sound, video, and so on, as well as HTML. Do you think the effectiveness of this technique is better or worse for JPEG images as compared to HTML? 5/13/2024 32

  33. Homework#1.5 Objective 5: Able to compute the response time as seen by a client that issued a HTTP request. Key point: Queueing delay at the router can be a significant factor in response time The heavier the traffic, the longer the queue at the router, which causes the longer delay Using caching, a portion of the HTTP requests will be served at the local proxy server instead of computing for resources at the router, which reduces the queue at the router and shorten the response time Problem. Consider an institutional network connected to the internet. Suppose that the average object size is 900,000 bits and that the average request rate from the institution s browsers to the origin servers is 1.5 requests per second. The bandwidth of the access link is 1.5 Mbps. Also suppose that the amount of time it takes from when the router on the Internet side of the access link forwards an HTTP request until it receives the response in two seconds on average. Model the total average response time as the sum of the average access delay (that is, the delay from Internet router to institution router) and the average Internet delay. For the average access delay, use D/(1-Db), where D is the average time required to send an object over the access link and b is the arrival rate of objects to the access link. (1)Find the total average response time. (2) Now suppose a cache is installed in the institutional LAN. Suppose the hit rate is 0.4. Find the total average response time 5/13/2024 33

  34. Homework#1.6 Objective 6: Able to fully understand HTTP Key points HTTP has two versions so far: 1.0 and 1.1 HTTP 1.0: TCP connection is not persistent, that is, to retrieve any object, a new TCP connection is made and then is terminated HTTP 1.1: Support persistent TCP connection in that the same connection is used to retrieve multiple objects if necessary TCP uses a three-way handshake to establish a new connection, which adds a round trip delay by itself Problem: Suppose within your Web browser you click on a link to obtain a Web page. The IP address for the associated URL has already been cached in your local host so a DNS look-up is unnecessary. Suppose that the Web page associated with the link contains some HTML text and 20 jpeg images that are hosted on the same Web server. Let RTT0 denote the RTT between the local host and the Web server. Assuming 0 transmission time of the HTML text and the embedded images, calculate how much time elapses from when the client clicks on the link until the client receives the Web page (HTML text and the embedded images) under the following scenarios: HTTP 1.1 with pipelining (and with persistent connection) HTTP 1.1 without pipelining (and with persistent connection) 5/13/2024 34

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#