To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??一?ぜ遺?? 11101001111100100011111100111111100010001110101000111111100000101011101010001000111000100011111100111111 e9f23f3f88ea3f82ba88e23f3f
EUC-JP 鶯??一?ぜ遺?? 11110010111101000011111100111111101100001110110000111111101001001011110010110000111001000011111100111111 f2f43f3fb0ec3fa4bcb0e43f3f
UTF-8 鶯볤쑬一룩ぜ遺쇄봼 111010011011011010101111111010111011001110100100111011001001000110101100111001001011100010000000111010111010001110101001111000111000000110011100111010011000000110111010111011001000011110000100111010111011010010111100 e9b6afebb3a4ec91ace4b880eba3a9e3819ce981baec8784ebb4bc
UHC 鶯볤쑬一룩ぜ遺쇄봼 111001011010001110010011111010101011111010101000111011001110100110110111111010001010101010111100111010111011011010111100111000101001010010000011 e5a393eabea8ece9b7e8aabcebb6bce29483

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)