To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壕??麥??湖?霽? 1000110110001000001111110011111111101010011011010011111100111111100011001100111000111111111010001100011100111111 8d883f3fea6d3f3f8cce3fe8c73f
EUC-JP 壕?祜麥??湖?霽? 10111001111010000011111110001111110100001101100011110011110011100011111100111111101110001101000000111111111100001100100100111111 b9e83f8fd0d8f3ce3f3fb8d03ff0c93f
UTF-8 壕렚祜麥렩렍湖렕霽렢 111001011010001110010101111010111010000010011010111001111010010110011100111010011011101010100101111010111010000010101001111010111010000010001101111001101011100110010110111010111010000010010101111010011001110010111101111010111010000010100010 e5a395eba09ae7a59ce9baa5eba0a9eba08de6b996eba095e99cbdeba0a2
UHC 壕렚祜麥렩렍湖렕霽렢 1111101110111101100011101010110111111011110101001101100011101010100011101011011110001110101000111111101111001001100011101010101011110000101110001000111010110011 fbbd8eadfbd4d8ea8eb78ea3fbc98eaaf0b88eb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)