To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????T 00111111001111110011111100111111001111110011111100111111001111110011111101010100 3f3f3f3f3f3f3f3f3f54
SJIS-WIN ???陰?????T 0011111100111111001111111000100101000001001111110011111100111111001111110011111101010100 3f3f3f89413f3f3f3f3f54
EUC-JP 縯?ł陰?????T 100011111101010011001011001111111000111110101001110010001011000110100010001111110011111100111111001111110011111101010100 8fd4cb3f8fa9c8b1a23f3f3f3f3f54
UTF-8 縯롫ł陰쏁굨紐뉖굶T 111001111011100010101111111010111010000110101011110001011000001011101001100110011011000011101100100011111000000111101010101101011010100011101111101001111000111111101011100010011001011011101010101101011011011001010100 e7b8afeba1abc582e999b0ec8f81eab5a8efa78feb8996eab5b654
UHC 縯롫ł陰쏁굨紐뉖굶T 11100110111000001000111011101011101010011010100111101011111001001001101111100111100000101000111011101011101010101000011111101011101100011011111001010100 e6e08eeba9a9ebe49be7828eebaa87ebb1be54

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)