To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??萸??諛??? 10001000101000110011111100111111111001001100111000111111001111111110011010000111001111110011111100111111 88a33f3fe4ce3f3fe6873f3f3f
EUC-JP 哀??萸??諛??? 10110000101001010011111100111111111010001101000000111111001111111110101111100111001111110011111100111111 b0a53f3fe8d03f3febe73f3f3f
UTF-8 哀노맩萸욜㎉諛곕쳨銳 111001011001001110000000111010111000010110111000111010111010011110101001111010001001000010111000111011001001101010011100111000111000111010001001111010001010101110011011111010101011001110010101111011001011001110101000111010011000101010110011 e59380eb85b8eba7a9e890b8ec9a9ce38e89e8ab9beab395ecb3a8e98ab3
UHC 哀노맩萸욜㎉諛곕쳨銳 1110010011101110101100111110101110010000101100011110101110101101101111111110011110100111101110111110101110110000101100001110101110101011100011011110011111100101 e4eeb3eb90b1ebadbfe7a7bbebb0b0ebab8de7e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)