To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 塋ょ?言?ⅹ? 1001101011001000100000101110010100111111100011001011111000111111111110100100100100111111 9ac882e53f8cbe3ffa493f
EUC-JP 塋ょ?言??? 11010100110010101010010011100111001111111011100011000000001111110011111100111111 d4caa4e73fb8c03f3f3f
UTF-8 塋ょ텥言됧ⅹ呂 111001011010000110001011111000111000001010000111111011011000010110100101111010001010100010000000111010111001000010100111111000101000010110111001111011111010011010000000 e5a18be38287ed85a5e8a880eb90a7e285b9efa680
UHC 塋ょ텥言됧ⅹ呂 1110011110101011101010101110011110110110100110101110010111101011100010011110010110100101101010101110010111111011 e7abaae7b69ae5eb89e5a5aae5fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)