it's got semantic "awareness" (using that term very loosely and not to imply cognition or consciousness). my understanding is that one of the hardest problems in computer science - semantic categorization of data - was partly solved by emergent properties of the network architecture they're using.
I don't really understand it much better than that, but it does appear that over time parts of the network tune themselves to be highly sensitive to very specific semantic concepts.