To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 冶??魏?┠碎 1001011011101000001111110011111111101001101100000011111110000100101101011110000111101010 96e83f3fe9b03f84b5e1ea
EUC-JP 冶??魏?┠碎 1100110011101010001111110011111111110010101100100011111110101000101101111110001011101100 ccea3f3ff2b23fa8b7e2ec
UTF-8 冶싢넃魏랃┠碎 111001011000011010110110111011001000101110100010111010111000010010000011111010011010110110001111111010111001111010000011111000101001010010100000111001111010001010001110 e586b6ec8ba2eb8483e9ad8feb9e83e294a0e7a28e
UHC 冶싢넃魏랃┠碎 1110010110100111100110101110001010000110100100111110101011100000100011011110111110100110101101111110000111101111 e5a79ae28693eae08defa6b7e1ef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)