Σύνθετα Δίκτυα. com+plex: with+ -fold (having parts) Διδάζκων Δημήηριος Καηζαρός

Σχετικά έγγραφα
Fractional Colorings and Zykov Products of graphs

EE512: Error Control Coding

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 19/5/2007

Nowhere-zero flows Let be a digraph, Abelian group. A Γ-circulation in is a mapping : such that, where, and : tail in X, head in

Αλγόριθμοι και πολυπλοκότητα NP-Completeness (2)

2 Composition. Invertible Mappings

Approximation of distance between locations on earth given by latitude and longitude

Main source: "Discrete-time systems and computer control" by Α. ΣΚΟΔΡΑΣ ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ ΔΙΑΛΕΞΗ 4 ΔΙΑΦΑΝΕΙΑ 1

Démographie spatiale/spatial Demography

derivation of the Laplacian from rectangular to spherical coordinates

Section 8.3 Trigonometric Equations

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

Homework 3 Solutions

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

The Simply Typed Lambda Calculus

[1] P Q. Fig. 3.1

Every set of first-order formulas is equivalent to an independent set

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 6/5/2006

Statistical Inference I Locally most powerful tests

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

Other Test Constructions: Likelihood Ratio & Bayes Tests

Physical DB Design. B-Trees Index files can become quite large for large main files Indices on index files are possible.

ΚΥΠΡΙΑΚΟΣ ΣΥΝΔΕΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY 21 ος ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ Δεύτερος Γύρος - 30 Μαρτίου 2011

Example Sheet 3 Solutions

Practice Exam 2. Conceptual Questions. 1. State a Basic identity and then verify it. (a) Identity: Solution: One identity is csc(θ) = 1

Lecture 2: Dirac notation and a review of linear algebra Read Sakurai chapter 1, Baym chatper 3

4.6 Autoregressive Moving Average Model ARMA(1,1)

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΕΙΡΑΙΑ ΤΜΗΜΑ ΝΑΥΤΙΛΙΑΚΩΝ ΣΠΟΥΔΩΝ ΠΡΟΓΡΑΜΜΑ ΜΕΤΑΠΤΥΧΙΑΚΩΝ ΣΠΟΥΔΩΝ ΣΤΗΝ ΝΑΥΤΙΛΙΑ

Fourier Series. MATH 211, Calculus II. J. Robert Buchanan. Spring Department of Mathematics

The challenges of non-stable predicates

ΤΕΧΝΟΛΟΓΙΚΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΥΠΡΟΥ ΤΜΗΜΑ ΝΟΣΗΛΕΥΤΙΚΗΣ

The Probabilistic Method - Probabilistic Techniques. Lecture 7: The Janson Inequality

C.S. 430 Assignment 6, Sample Solutions

5.4 The Poisson Distribution.

Distances in Sierpiński Triangle Graphs

ω ω ω ω ω ω+2 ω ω+2 + ω ω ω ω+2 + ω ω+1 ω ω+2 2 ω ω ω ω ω ω ω ω+1 ω ω2 ω ω2 + ω ω ω2 + ω ω ω ω2 + ω ω+1 ω ω2 + ω ω+1 + ω ω ω ω2 + ω

6.1. Dirac Equation. Hamiltonian. Dirac Eq.

Assalamu `alaikum wr. wb.

Code Breaker. TEACHER s NOTES

ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΥΠΡΟΥ ΤΜΗΜΑ ΠΛΗΡΟΦΟΡΙΚΗΣ. ΕΠΛ342: Βάσεις Δεδομένων. Χειμερινό Εξάμηνο Φροντιστήριο 10 ΛΥΣΕΙΣ. Επερωτήσεις SQL

Αλγόριθμοι και πολυπλοκότητα Depth-First Search

EPL 603 TOPICS IN SOFTWARE ENGINEERING. Lab 5: Component Adaptation Environment (COPE)

Abstract Storage Devices

Block Ciphers Modes. Ramki Thurimella

ΠΑΝΕΠΙΣΤΗΜΙΟ ΠΑΤΡΩΝ ΠΟΛΥΤΕΧΝΙΚΗ ΣΧΟΛΗ ΤΜΗΜΑ ΜΗΧΑΝΙΚΩΝ Η/Υ & ΠΛΗΡΟΦΟΡΙΚΗΣ. του Γεράσιμου Τουλιάτου ΑΜ: 697

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

SCHOOL OF MATHEMATICAL SCIENCES G11LMA Linear Mathematics Examination Solutions

On a four-dimensional hyperbolic manifold with finite volume

Second Order RLC Filters

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 24/3/2007

Concrete Mathematics Exercises from 30 September 2016

Matrices and Determinants

ΑΛΓΟΡΙΘΜΟΙ Άνοιξη I. ΜΗΛΗΣ

Μηχανισμοί πρόβλεψης προσήμων σε προσημασμένα μοντέλα κοινωνικών δικτύων ΔΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ

Finite Field Problems: Solutions

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

Πρόβλημα 1: Αναζήτηση Ελάχιστης/Μέγιστης Τιμής

Numerical Analysis FMN011

Section 1: Listening and responding. Presenter: Niki Farfara MGTAV VCE Seminar 7 August 2016

Capacitors - Capacitance, Charge and Potential Difference

Partial Differential Equations in Biology The boundary element method. March 26, 2013

Σχέσεις, Ιδιότητες, Κλειστότητες

About these lecture notes. Simply Typed λ-calculus. Types

( ) 2 and compare to M.

ΤΜΗΜΑ ΗΛΕΚΤΡΟΛΟΓΩΝ ΜΗΧΑΝΙΚΩΝ ΚΑΙ ΜΗΧΑΝΙΚΩΝ ΥΠΟΛΟΓΙΣΤΩΝ

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

Τ.Ε.Ι. ΔΥΤΙΚΗΣ ΜΑΚΕΔΟΝΙΑΣ ΠΑΡΑΡΤΗΜΑ ΚΑΣΤΟΡΙΑΣ ΤΜΗΜΑ ΔΗΜΟΣΙΩΝ ΣΧΕΣΕΩΝ & ΕΠΙΚΟΙΝΩΝΙΑΣ

9.09. # 1. Area inside the oval limaçon r = cos θ. To graph, start with θ = 0 so r = 6. Compute dr

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

TMA4115 Matematikk 3

«Χρήσεις γης, αξίες γης και κυκλοφοριακές ρυθμίσεις στο Δήμο Χαλκιδέων. Η μεταξύ τους σχέση και εξέλιξη.»

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

Συντακτικές λειτουργίες

Solutions to the Schrodinger equation atomic orbitals. Ψ 1 s Ψ 2 s Ψ 2 px Ψ 2 py Ψ 2 pz

ΕΛΛΗΝΙΚΗ ΔΗΜΟΚΡΑΤΙΑ ΠΑΝΕΠΙΣΤΗΜΙΟ ΚΡΗΤΗΣ. Ψηφιακή Οικονομία. Διάλεξη 10η: Basics of Game Theory part 2 Mαρίνα Μπιτσάκη Τμήμα Επιστήμης Υπολογιστών

the total number of electrons passing through the lamp.

Bounding Nonsplitting Enumeration Degrees

A Hierarchy of Theta Bodies for Polynomial Systems

Potential Dividers. 46 minutes. 46 marks. Page 1 of 11

Instruction Execution Times

ΕΘΝΙΚΟ ΜΕΤΣΟΒΙΟ ΠΟΛΥΤΕΧΝΕΙΟ

ΚΥΠΡΙΑΚΗ ΕΤΑΙΡΕΙΑ ΠΛΗΡΟΦΟΡΙΚΗΣ CYPRUS COMPUTER SOCIETY ΠΑΓΚΥΠΡΙΟΣ ΜΑΘΗΤΙΚΟΣ ΔΙΑΓΩΝΙΣΜΟΣ ΠΛΗΡΟΦΟΡΙΚΗΣ 11/3/2006

Math 6 SL Probability Distributions Practice Test Mark Scheme

ΑΓΓΛΙΚΗ ΓΛΩΣΣΑ ΣΕ ΕΙΔΙΚΑ ΘΕΜΑΤΑ ΔΙΕΘΝΩΝ ΣΧΕΣΕΩΝ & ΟΙΚΟΝΟΜΙΑΣ

Section 9.2 Polar Equations and Graphs

Inverse trigonometric functions & General Solution of Trigonometric Equations

ΑΚΑ ΗΜΙΑ ΕΜΠΟΡΙΚΟΥ ΝΑΥΤΙΚΟΥ ΜΑΚΕ ΟΝΙΑΣ ΣΧΟΛΗ ΜΗΧΑΝΙΚΩΝ ΠΤΥΧΙΑΚΗ ΕΡΓΑΣΙΑ

Απόκριση σε Μοναδιαία Ωστική Δύναμη (Unit Impulse) Απόκριση σε Δυνάμεις Αυθαίρετα Μεταβαλλόμενες με το Χρόνο. Απόστολος Σ.

ΕΘΝΙΚΟ ΚΑΙ ΚΑΠΟΔΙΣΤΡΙΑΚΟ ΠΑΝΕΠΙΣΤΗΜΙΟ ΑΘΗΝΩΝ ΣΧΟΛΗ ΘΕΤΙΚΩΝ ΕΠΙΣΤΗΜΩΝ ΤΜΗΜΑ ΠΛΗΡΟΦΟΡΙΚΗΣ ΚΑΙ ΤΗΛΕΠΙΚΟΙΝΩΝΙΩΝ

Mean bond enthalpy Standard enthalpy of formation Bond N H N N N N H O O O

( y) Partial Differential Equations

ST5224: Advanced Statistical Theory II

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

Review Test 3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

PARTIAL NOTES for 6.1 Trigonometric Identities

10/3/ revolution = 360 = 2 π radians = = x. 2π = x = 360 = : Measures of Angles and Rotations

Μηχανική Μάθηση Hypothesis Testing

F-TF Sum and Difference angle

Transcript:

Σύνθετα Δίκτυα com+plex: with+ -fold (having parts) Διδάζκων Δημήηριος Καηζαρός Διάλεξη 09η: 31/03/2016 1

Μεηρικές κενηρικόηηηας Centrality measures 2

Ασκήσεις Άσκηση 1. Να βρεθεί η closeness centrality των κόμβων του διπλανού γραφήματος. Άσκηση 2. Να υπολογίσετε τις τιμές του SPBC για κάθε κόμβο του παρακάτω σχήματος. Στον πίνακα που έπεται δίνονται όλα τα συντομότερα μονοπάτια. 3

Ασκήσεις 4

Ασκήσεις Άσκηση 3. Να βρεθεί το minimum controlling set της PageRank centrality κάθε κόμβου του παρακάτω γραφήματος. 5

Communities in Complex Networks Κοινόηηηες ζε Σύνθεηα Δίκησα 6

Τι είναι μια Web community; 7

Υπάρχουν κοινότητες στις δημοφιλείς Web εφαρμογές; 8

Web communities in more formal terms Communities on the Web are groups of individuals who share a common interest, together with the Web pages most popular amongst them explicit or implicit? Web community is a set of Web-based objects (documents and users) that has its own logical and semantic structures, such that information retrieval and Webdata management is facilitated. e.g. a Web page set with clusters in it is a community; web pages in a set that are related to a given Web page also form a community; Communities are groups of vertices which probably share common properties and/or play similar roles within the graph, e.g. groups of Web pages dealing with related topics 9

two different types of communities easily identified explicitly-defined communities well-known group of web pages sharing a common interest e.g. Yahoo eg. textbooks books novels (graph) analysis required implicitly-defined communities non-obvious; hidden or unexpected;larger and outnumber the explicit ones; may appear as an emerging Web community for some specific topic or event; Computer science History eg. group of web pages for mediterranean cooking 10

in other terms managerially coordinated communities: they have a central authority or process (e.g. a creator) that governs the formation mechanism. e.g. Google groups, LinkedIn, Drupal groups, Facebook, etc self-organized communities: they emerge from the interaction patterns between the members, i.e. highly related members are identified as a community. e.g. Flickr clusters 11

communities context... typically... communities are defined with reference to some graph (network) which represents a set of entities / objects (nodes) and their relations (edges).... even when there is no explicit graph, one can infer it, e.g.: feature vectors distances threshold application graph Given a graph, a community is loosely defined as a set of nodes that are more densely connected to each other than to the rest of the graph vertices. 12

A simple example... extremely profound community structure. inter-community edge key-concepts : withincommunity nodes, intracommunity edges, intercommunity edges. intra-community edge rarely appearing in real systems. Definition of communities is heavily dependent on graph properties and subgraphs discovery 13

Edge centrality & betweenness edge betweenness is the number of shortest paths between pairs of nodes that run along this edge. If there is more than one shortest path between a pair of nodes, each path is assigned equal weight such that the total weight of all of the paths is equal to unity. edges with the maximum score are considered as important for keeping the graph interconnected i.e. their removal results in unconnected clusters. High scoring edges are the weak ties or bottlenecks for traffic moving about the network source : The igraph library project. 14

Subgraph structures (I) k-clique: maximal subgraph each vertex of which is adjacent to all others. k- clique problem is in class NP, i.e. there is no solution to finding cliques in polynomial time, but if a clique is found in the graph, it can be verified in polynomial time. k=3 (triangle) k=4 k=5 too strong for many purposes N-clique: maximal subgraph in which the maximum distance between two vertices is N, i.e. N stands for the length of the path allowed to make a connection to all other members (e.g. if path distance is 2 this corresponds to being "a friend of a friend. ) more helpful and general N=2 (star) 15

Subgraph structures (II) k-core: maximal subgraph in which each vertex is connected to at least k other vertices. k-core is computed by pruning all the vertices (with their respective edges) with degree less than k. this operation can be useful to filter or to study some properties of the graphs 3-core 1-core 2-cores 0-core k-plex: maximal subgraph in which each vertex is connected to all vertices except at most k of them (complementary to k-core) k-core approach is more relaxed, by including vertices once they are connected to k members, regardless of how many other members they may not be connected to. 16

Specific graphs/subgraphs (II) complete bipartite graph or biclique: a special kind of bipartite graph where every vertex of the first set is connected to every vertex of the second set. Bipartite core : a complete bipartite subgraph with at least i nodes from U and at least j nodes from V i and j are tunable parameters A (i=3, j=3) bipartite core 17

The notion of graph cut Given two vertices u and v of graph G(V,E) a cut is a set of edges C E such that the two vertices become separated on the induced graph G (V, E \ C) denoting a source terminal as s and a sink terminal as t, a cut (S, T) of G(V,E) is a partition of V into S and T = V \ S, such that s S and t T Max-flow - min-cut theorem originally proposed by Ford and Fulkerson (1956) proves that maximum flow of the network is identical to minimum cut that separates s and t 18

The role of the flows Definition (s-t Maximum Flow): given a directed graph, G=(V,E), with edge capacities c(u,v) 0, and two vertices, s, t V, find the maximum flow that can be routed from the source, s, to the sink, t Intuition: think of water pipes Note: maximum flow = minimum cut. Maximum flow yields communities 4/4 1 5/9 4/5 2 4 0/8 2/2 3 5 3/3 1/1 3/3 4/9 6 19

Ορισμοί κοινοτήτων: Κατηγοριοποίηση community structure context Graph / linkage spectral flow user Web 2.0 overlapping nonoverlapping 20

Emphasis on linkage density communities are subsets of vertices within which vertex-vertex connections are dense, but between which connections are less dense [Girvan02], [Newman04c]. communities in the Web might represent pages on related topics. Communities are groups of vertices which probably share common properties and/or play similar roles within the graph. [Fortunato07a]. communities may correspond to groups of pages of the World Wide Web dealing with related topics 21

Cores & communities A bipartite core is the identity of a community and to extract all the communities is to enumerate all the bipartite cores on the web A web community is a (μ,ε)-core which is defined starting from a core vertex if its ε-neighborhood contains at least μ vertices A cluster consists of all vertices that belong to a core s neighborhood hubs: isolated vertices connected to more than one clusters w.r.t. ε and μ. outliers: isolated vertices connected to only one (μ,ε)-core. 13 8 9 7 12 11 6 10 3 4 0 5 2 1 22

Web communities through bipartite graphs Dense bipartite graph Complete bipartite graph A Web community is a set of pages that form a dense bipartite graph A community is defined as a set of web pages whose hyperlinks form a complete bipartite graph since a complete bipartite graph abstraction extracts a small set of potential members to agree on some common interests 23

Ορισμοί κοινοτήτων: Κατηγοριοποίηση community structure context Graph / linkage spectral flow user Web 2.0 overlapping nonoverlapping 24

Επικαλυπτόμενες κοινότητες majority of community definitions: driven by crisp clustering why not consider non-crisp clustering? i.e., a vertex may belong to multiple communities it better fits human intuition still, does not require any text processing since communities are not disjoint, different sets of communities are solutions scalability to graphs with millions of edges is the challenge in many cases: necessary in order to produce reasonable results (see next slide) 25

Επικάλυψη σε κοινότητες Different communities are rendered in different colors. Edges between communities are colored in light gray. Overlapping regions between communities are emphasized in red 26

Υπολογισμός της edge betwenness 27

Αλγόριθμος των Girvan-Newman (GN) 1. Υπολογισμός της betweenness για όλες τις ακμές του δικτύου 2. Αφαίρεση της ακμής με την μεγαλύτερη betweenness 3. Επανυπολογισμός της betweennesses για όλες τις ακμές, οι οποίες επηρεάζονται από την αφαίρεση 4. Επανάληψη από το Βήμα 2, μέχρι να εξαλειφθούν όλες οι ακμές 28

Ανάλυση του αλγορίθμου των GN To improve performance, the betweennesses can be calculated by using the fast algorithm of Newman, which calculates betweenness for all m edges in a graph of n vertices in time O(mn) Because this calculation has to be repeated once for the removal of each-edge, the entire algorithm runs in worstcase time O (m^2 n) However, after the removal of each edge, we only have to recalculate the betweennesses of those edges that were affected by the removal, which is at most only those in the same component as the removed edge. This means that running time may be better than worst-case for networks with strong community structure 29

Alternative improvements To try to reduce the running time of the algorithm further, one might be tempted to calculate the betweennesses of all edges only once and then remove them in order of decreasing betweenness However, It is found that this strategy does not work well, because if two communities are connected by more than one edge, then there is no guarantee that all of those edges will have high betweenness we only know that at least one of them will 30

Ισχυρές κοινότητες community in a strong sense each node in a strong community has more connections within (intra-) the community than with the rest (inter-) of the graph k in ( ) out i V ki, i V Red-colored nodes do not belong to any community 31

Ασθενείς κοινότητες community in a weak sense in a weak community the sum of all degrees within V is larger than sum of all degrees toward the rest of the network i V k in ( V ) k out, i V i i i V Red-colored node belongs to the community, even though its outdegree > indegree 32

Ορισμός για ασθενέστερο από τον ισχυρό ορισμό Κοινότητα με πιο `ασθενή ορισμό each node s degree inside the weaker than the strong community should not be smaller than the node s degree toward any other community Εάν V 1, V 2,, V m είναι m communities του G, V k, k=1,2,,m, πρέπει να ισχύει: και Red-colored node belongs to this community (or it could belong to the upper community) 2 33