To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????Q???T 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101000100111111001111110011111101010100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f513f3f3f54
SJIS-WIN 巐わ乘巐ゎァ呷嶸ゥ巐ゎァ呷嶸ェQ巐わ亂T 1111101010110110100000101110110110011000101010011111101010110110100000101110110010100111100110011110110011111000100000101111101010110100101010011111101010110110100000101110110010100111100110011110110011111000100000101111101010110100101010100101000111111010101101101000001011101101100110001010101001010100 fab682ed98a9fab682eca799ecf882fab4a9fab682eca799ecf882fab4aa51fab682ed98aa54
EUC-JP 巐わ乘巐ゎァ呷?嶸ゥ巐ゎァ呷?嶸ェQ巐わ亂T 10001111101110111111100110100100111011111101000010101011100011111011101111111001101001001110111010001110101001111101001011101110001111111000111110111011111101001000111010101001100011111011101111111001101001001110111010001110101001111101001011101110001111111000111110111011111101001000111010101010010100011000111110111011111110011010010011101111110100001010110001010100 8fbbf9a4efd0ab8fbbf9a4ee8ea7d2ee3f8fbbf48ea98fbbf9a4ee8ea7d2ee3f8fbbf48eaa518fbbf9a4efd0ac54
UTF-8 巐わ乘巐ゎァ呷嶸ゥ巐ゎァ呷嶸ェQ巐わ亂T 1110010110110111100100001110001110000010100011111110010010111001100110001110010110110111100100001110001110000010100011101110111110111101101001111110010110010001101101111110111010011000101000011110010110110110101110001110111110111101101010011110010110110111100100001110001110000010100011101110111110111101101001111110010110010001101101111110111010011000101000011110010110110110101110001110111110111101101010100101000111100101101101111001000011100011100000101000111111100100101110101000001001010100 e5b790e3828fe4b998e5b790e3828eefbda7e591b7ee98a1e5b6b8efbda9e5b790e3828eefbda7e591b7ee98a1e5b6b8efbdaa51e5b790e3828fe4ba8254
UHC ?わ乘?ゎ???嶸??ゎ???嶸?Q?わ亂T 001111111010101011101111111000111010101100111111101010101110111000111111001111110011111111100111101011100011111100111111101010101110111000111111001111110011111111100111101011100011111101010001001111111010101011101111110101011010111101010100 3faaefe3ab3faaee3f3f3fe7ae3f3faaee3f3f3fe7ae3f513faaefd5af54

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)