To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲軸篠叱篠軸偲鹿偲ヤト漆篠セト嫉篠執 1000111011000011100011101011001010001110110000101000111010110110100011101100001010001110101100101000111011000011100011101010110110001110110000111101010011000100100011101011110110001110110000101011111011000100100011101011100110001110110000101000111010110111 8ec38eb28ec28eb68ec28eb28ec38ead8ec3d4c48ebd8ec2bec48eb98ec28eb7
EUC-JP 偲軸篠叱篠軸偲鹿偲ヤト漆篠セト嫉篠執 101111001100010110111100101101001011110011000100101111001011100010111100110001001011110010110100101111001100010110111100101011111011110011000101100011101101010010001110110001001011110010111111101111001100010010001110101111101000111011000100101111001011101110111100110001001011110010111001 bcc5bcb4bcc4bcb8bcc4bcb4bcc5bcafbcc58ed48ec4bcbfbcc48ebe8ec4bcbbbcc4bcb9
UTF-8 偲軸篠叱篠軸偲鹿偲ヤト漆篠セト嫉篠執 111001011000000110110010111010001011101110111000111001111010111110100000111001011000111110110001111001111010111110100000111010001011101110111000111001011000000110110010111010011011100110111111111001011000000110110010111011111011111010010100111011111011111010000100111001101011110010000110111001111010111110100000111011111011110110111110111011111011111010000100111001011010101110001001111001111010111110100000111001011001111110110111 e581b2e8bbb8e7afa0e58fb1e7afa0e8bbb8e581b2e9b9bfe581b2efbe94efbe84e6bc86e7afa0efbdbeefbe84e5ab89e7afa0e59fb7
UHC ?軸篠叱篠軸?鹿???漆篠??嫉篠執 0011111111110101111011101110000111000110111100101110101011100001110001101111010111101110001111111101011011100011001111110011111100111111111101101101010011100001110001100011111100111111111100101110110011100001110001101111001011111011 3ff5eee1c6f2eae1c6f5ee3fd6e33f3f3ff6d4e1c63f3ff2ece1c6f2fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)