To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 獄??節ε?厄? 100011011001011000111111001111111001000011011111100000111100001100111111100101101110111100111111 8d963f3f90df83c33f96ef3f
EUC-JP 獄??節ε?厄? 101110011111011000111111001111111100000011100001101001101100010100111111110011001111000100111111 b9f63f3fc0e1a6c53fccf13f
UTF-8 獄쎽폏節ε퇌厄턔 1110011110001101100001001110110010001110101111011110110110001111100011111110011110101111100000001100111010110101111011011000011110001100111001011000111010000100111011011000010010010100 e78d84ec8ebded8f8fe7af80ceb5ed878ce58e84ed8494
UHC 獄쎽폏節ε퇌厄턔 11101000101010111001101111100100101111001001101011101111101111011010010111100101101101111001110111100100111110001011011001001111 e8ab9be4bc9aefbda5e5b79de4f8b64f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)