Motivation
Topology maps of the Internet are an important tool for characterizing this critical infrastructure and understanding its properties, dynamic behavior, and evolution. They are also crucial for realistic modeling, simulation, and analysis of Internet infrastructure and other large-scale complex networks. These maps can be constructed for different layers (or granularities), e.g., fiber/copper cable, IP address, router, Points-of-Presence (PoPs), autonomous system (AS), ISP/organization. Router-level and PoP-level topology maps can powerfully inform and calibrate vulnerability assessments. ISP-level topologies, sometimes called AS-level or interdomain routing topologies (although an ISP may own multiple ASes so an AS-level graph is a slightly finer granularity) provide insights into technical, economic, policy, and security needs of the largely unregulated peering ecosystem.
CAIDA has been conducting measurements of the Internet macroscopic topology since 1998. Our tools - first skitter (1998-2008), now scamper (2007-present) have been tracking global IP level connectivity by sending probe packets from a set of source monitors to millions of geographically-distributed destinations in the IPv4 address space. Since 2003, we have been continuously probing IPv6 address space as well.
The gathered data:
- characterize macroscopic connectivity and performance of the Internet,
- allow various topological and geographical representations at multiple levels of aggregation granularity,
- provide a valuable input for empirically-based modeling of the Internet behavior and properties,
- improve situational awareness of the critical cyberinfrastructure for government agencies.
Data sources
We use two sources of data for Macroscopic Topology studies: forward Internet (IP) path information from traceroute-like active measurements and routing data from inter-domain BGP routing tables.
- Active measurement data
Co-funded by DHS and NSF, we have created a powerful and versatile distributed measurement infrastructure Archipelago (Ark) that makes use of measurement nodes located in various networks worldwide and connected via the Internet to a central server at CAIDA. Ark has pioneered new features and functionality of distributed measurement infrastructure, including flexible and efficient measurement and data collection methods. Ark topology datasets (described below) available to academic researchers and government agencies via the

