To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 樗??蠢?堤? 10010010100101000011111100111111111001011011111100111111100100101110011100111111 92943f3fe5bf3f92e73f
EUC-JP 樗??蠢?堤? 11000011111101000011111100111111111010101100000100111111110001001110100100111111 c3f43f3feac13fc4e93f
UTF-8 樗곌짊蠢렊堤렩 111001101010100010010111111010101011001110001100111011001010011110001010111010001010000010100010111010111010000010001010111001011010000010100100111010111010000010101001 e6a897eab38ceca78ae8a0a2eba08ae5a0a4eba0a9
UHC 樗곌짊蠢렊堤렩 1110111011000000101100001110101011000001111110111111000111100011100011101010000111110000101001111000111010110111 eec0b0eac1fbf1e38ea1f0a78eb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)