To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h 00111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f68
SJIS-WIN 撓??敖??汚??h 10011101100110100011111100111111100111011100001000111111001111111000100110011000001111110011111101101000 9d9a3f3f9dc23f3f89983f3f68
EUC-JP 撓??敖??汚??h 11011001111110100011111100111111110110101100010000111111001111111011000111111000001111110011111101101000 d9fa3f3fdac43f3fb1f83f3f68
UTF-8 撓뽳슈敖삼슥汚얏썑h 11100110100100101001001111101011101111011011001111101100100010101000100011100110100101011001011011101100100000101011110011101100100010101010010111100110101100011001101011101100100101101000111111101100100011011001000101101000 e69293ebbdb3ec8a88e69596ec82bcec8aa5e6b19aec968fec8d9168
UHC 撓뽳슈敖삼슥汚얏썑h 11101000111101011001011011101111101111011011010011100111111110011011101111101111101111011011101111100111111111011011111011100110100110111000010001101000 e8f596efbdb4e7f9bbefbdbbe7fdbee69b8468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)