To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 玉??獄??堊↑イ臆 10001011110010100011111100111111100011011001011000111111001111111001101010111111100000011010101010000011010000111000100110110000 8bca3f3f8d963f3f9abf81aa834389b0
EUC-JP 玉??獄??堊↑イ臆 10110110110011000011111100111111101110011111011000111111001111111101010011000001101000101010110010100101101001001011001010110010 b6cc3f3fb9f63f3fd4c1a2aca5a4b2b2
UTF-8 玉뉛슴獄ㅴ퉿堊↑イ臆 111001111000111010001001111010111000100110011011111011001000101010110100111001111000110110000100111000111000010110110100111011011000100110111111111001011010000010001010111000101000011010010001111000111000001010100100111010001000011110000110 e78e89eb899bec8ab4e78d84e385b4ed89bfe5a08ae28691e382a4e88786
UHC 玉뉛슴獄ㅴ퉿堊↑イ臆 1110100010101100100001111110111110111101101111111110100010101011101001001110010010111001100101111110010010111110101000011110100010101011101001001110010111100110 e8ac87efbdbfe8aba4e4b997e4bea1e8aba4e5e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)