To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 專?┗孺???攀?拙 100110111001001100111111100001001010111110011011011111010011111100111111001111111001110110110011001111111001000011011001 9b933f84af9b7d3f3f3f9db33f90d9
EUC-JP 專?┗孺??邕攀?拙 1101010111110011001111111010100010110001110101011101111000111111001111111000111111100001111011011101101010110101001111111100000011011011 d5f33fa8b1d5de3f3f8fe1eddab53fc0db
UTF-8 專닷┗孺쇨룬邕攀섧拙 111001011011000010001000111010111000101110110111111000101001010010010111111001011010110110111010111011001000011110101000111010111010001110101100111010011000001010010101111001101001010010000000111011001000010010100111111001101000101110011001 e5b088eb8bb7e29497e5adbaec87a8eba3ace98295e69480ec84a7e68b99
UHC 專닷┗孺쇨룬邕攀섧拙 1110111011110110101101001110010110100110101100011110101011101000101111001110101010110111111010011110100010111011110110101110011110111100101101011111000011110000 eef6b4e5a6b1eae8bceab7e9e8bbdae7bcb5f0f0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)