To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲辞偲嫉篠シト叱篠セナワト七偲フナ竺篠シト疾 10001110110000111000111010101011100011101100001110001110101110011000111011000010101111001100010010001110101101101000111011000010101111101100010111011100110001001000111010110101100011101100001111001100110001011000111010110001100011101100001010111100110001001000111010111110 8ec38eab8ec38eb98ec2bcc48eb68ec2bec5dcc48eb58ec3ccc58eb18ec2bcc48ebe
EUC-JP 偲辞偲嫉篠シト叱篠セナワト七偲フナ竺篠シト疾 1011110011000101101111001010110110111100110001011011110010111011101111001100010010001110101111001000111011000100101111001011100010111100110001001000111010111110100011101100010110001110110111001000111011000100101111001011011110111100110001011000111011001100100011101100010110111100101100111011110011000100100011101011110010001110110001001011110011000000 bcc5bcadbcc5bcbbbcc48ebc8ec4bcb8bcc48ebe8ec58edc8ec4bcb7bcc58ecc8ec5bcb3bcc48ebc8ec4bcc0
UTF-8 偲辞偲嫉篠シト叱篠セナワト七偲フナ竺篠シト疾 111001011000000110110010111010001011111010011110111001011000000110110010111001011010101110001001111001111010111110100000111011111011110110111100111011111011111010000100111001011000111110110001111001111010111110100000111011111011110110111110111011111011111010000101111011111011111010011100111011111011111010000100111001001011100010000011111001011000000110110010111011111011111010001100111011111011111010000101111001111010101110111010111001111010111110100000111011111011110110111100111011111011111010000100111001111001011010111110 e581b2e8be9ee581b2e5ab89e7afa0efbdbcefbe84e58fb1e7afa0efbdbeefbe85efbe9cefbe84e4b883e581b2efbe8cefbe85e7abbae7afa0efbdbcefbe84e796be
UHC ???嫉篠??叱篠????七???竺篠??疾 001111110011111100111111111100101110110011100001110001100011111100111111111100101110101011100001110001100011111100111111001111110011111111110110110100100011111100111111001111111111010111100111111000011100011000111111001111111111001011110000 3f3f3ff2ece1c63f3ff2eae1c63f3f3f3ff6d23f3f3ff5e7e1c63f3ff2f0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)