To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 錞??臆??遜? 1111101111011011001111110011111110001001101100000011111100111111100100011011101100111111 fbdb3f3f89b03f3f91bb3f
EUC-JP 錞??臆??遜? 100011111110010011011100001111110011111110110010101100100011111100111111110000101011110100111111 8fe4dc3f3fb2b23f3fc2bd3f
UTF-8 錞딂둟臆뚥돴遜땵 111010011000110010011110111010111001010010000010111010111001000110011111111010001000011110000110111010111001101010100101111010111000111110110100111010011000000110011100111010111001010110110101 e98c9eeb9482eb919fe88786eb9aa5eb8fb4e9819ceb95b5
UHC 錞딂둟臆뚥돴遜땵 11100010111101101000101011101000100010100101011111100101111001101000110011100100100010011011011111100001111000011000101110001011 e2f68ae88a57e5e68ce489b7e1e18b8b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)