To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??nTkf??nTk^}Y??nTkf??nTk^}bE 0011111100111111011011100101010001101011011001100011111100111111011011100101010001101011010111100111110101011001001111110011111101101110010101000110101101100110001111110011111101101110010101000110101101011110011111010110001001000101 3f3f6e546b663f3f6e546b5e7d593f3f6e546b663f3f6e546b5e7d6245
SJIS-WIN 澈蒔nTkf澈蒔nTk^}Y澈蒔nTkf澈蒔nTk^}bE 11111011010010111000111010101010011011100101010001101011011001101111101101001011100011101010101001101110010101000110101101011110011111010101100111111011010010111000111010101010011011100101010001101011011001101111101101001011100011101010101001101110010101000110101101011110011111010110001001000101 fb4b8eaa6e546b66fb4b8eaa6e546b5e7d59fb4b8eaa6e546b66fb4b8eaa6e546b5e7d6245
EUC-JP 澈蒔nTkf澈蒔nTk^}Y澈蒔nTkf澈蒔nTk^}bE 1000111111001000111001011011110010101100011011100101010001101011011001101000111111001000111001011011110010101100011011100101010001101011010111100111110101011001100011111100100011100101101111001010110001101110010101000110101101100110100011111100100011100101101111001010110001101110010101000110101101011110011111010110001001000101 8fc8e5bcac6e546b668fc8e5bcac6e546b5e7d598fc8e5bcac6e546b668fc8e5bcac6e546b5e7d6245
UTF-8 澈蒔nTkf澈蒔nTk^}Y澈蒔nTkf澈蒔nTk^}bE 111001101011111010001000111010001001001010010100011011100101010001101011011001101110011010111110100010001110100010010010100101000110111001010100011010110101111001111101010110011110011010111110100010001110100010010010100101000110111001010100011010110110011011100110101111101000100011101000100100101001010001101110010101000110101101011110011111010110001001000101 e6be88e892946e546b66e6be88e892946e546b5e7d59e6be88e892946e546b66e6be88e892946e546b5e7d6245
UHC 澈蒔nTkf澈蒔nTk^}Y澈蒔nTkf澈蒔nTk^}bE 11110100110011011110001111001000011011100101010001101011011001101111010011001101111000111100100001101110010101000110101101011110011111010101100111110100110011011110001111001000011011100101010001101011011001101111010011001101111000111100100001101110010101000110101101011110011111010110001001000101 f4cde3c86e546b66f4cde3c86e546b5e7d59f4cde3c86e546b66f4cde3c86e546b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)