To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 冶??魏?┠碎?? 10010110111010000011111100111111111010011011000000111111100001001011010111100001111010100011111100111111 96e83f3fe9b03f84b5e1ea3f3f
EUC-JP 冶??魏?┠碎?? 11001100111010100011111100111111111100101011001000111111101010001011011111100010111011000011111100111111 ccea3f3ff2b23fa8b7e2ec3f3f
UTF-8 冶싢넃魏랃┠碎대퓯 111001011000011010110110111011001000101110100010111010111000010010000011111010011010110110001111111010111001111010000011111000101001010010100000111001111010001010001110111010111000110010000000111011011001001110101111 e586b6ec8ba2eb8483e9ad8feb9e83e294a0e7a28eeb8c80ed93af
UHC 冶싢넃魏랃┠碎대퓯 111001011010011110011010111000101000011010010011111010101110000010001101111011111010011010110111111000011110111110110100111010111011111110010110 e5a79ae28693eae08defa6b7e1efb4ebbf96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)