To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 樗?錚豆?弔? 1001001010010100001111111110100001000010100100111010010000111111100100101010001000111111 92943fe84293a43f92a23f
EUC-JP 樗?錚豆?弔? 1100001111110100001111111110111110100011110001101010011000111111110001001010010000111111 c3f43fefa3c6a63fc4a43f
UTF-8 樗렜錚豆땐弔렍 111001101010100010010111111010111010000010011100111010011000110010011010111010001011000110000110111010111001010110010000111001011011110010010100111010111010000010001101 e6a897eba09ce98c9ae8b186eb9590e5bc94eba08d
UHC 樗렜錚豆땐弔렍 1110111011000000100011101010111011101110101101101101010011100111101101101010100111110000110000001000111010100011 eec08eaeeeb6d4e7b6a9f0c08ea3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)