To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??]nkf??]nk^}Y??]nkf??]nk^}bE 0011111100111111010111010110111001101011011001100011111100111111010111010110111001101011010111100111110101011001001111110011111101011101011011100110101101100110001111110011111101011101011011100110101101011110011111010110001001000101 3f3f5d6e6b663f3f5d6e6b5e7d593f3f5d6e6b663f3f5d6e6b5e7d6245
SJIS-WIN 虞?]nkf虞?]nk^}Y虞?]nkf虞?]nk^}bE 100010111111000100111111010111010110111001101011011001101000101111110001001111110101110101101110011010110101111001111101010110011000101111110001001111110101110101101110011010110110011010001011111100010011111101011101011011100110101101011110011111010110001001000101 8bf13f5d6e6b668bf13f5d6e6b5e7d598bf13f5d6e6b668bf13f5d6e6b5e7d6245
EUC-JP 虞?]nkf虞?]nk^}Y虞?]nkf虞?]nk^}bE 101101101111001100111111010111010110111001101011011001101011011011110011001111110101110101101110011010110101111001111101010110011011011011110011001111110101110101101110011010110110011010110110111100110011111101011101011011100110101101011110011111010110001001000101 b6f33f5d6e6b66b6f33f5d6e6b5e7d59b6f33f5d6e6b66b6f33f5d6e6b5e7d6245
UTF-8 虞록]nkf虞록]nk^}Y虞록]nkf虞록]nk^}bE 111010001001100110011110111010111010000110011101010111010110111001101011011001101110100010011001100111101110101110100001100111010101110101101110011010110101111001111101010110011110100010011001100111101110101110100001100111010101110101101110011010110110011011101000100110011001111011101011101000011001110101011101011011100110101101011110011111010110001001000101 e8999eeba19d5d6e6b66e8999eeba19d5d6e6b5e7d59e8999eeba19d5d6e6b66e8999eeba19d5d6e6b5e7d6245
UHC 虞록]nkf虞록]nk^}Y虞록]nkf虞록]nk^}bE 11101001111001011011011111001111010111010110111001101011011001101110100111100101101101111100111101011101011011100110101101011110011111010101100111101001111001011011011111001111010111010110111001101011011001101110100111100101101101111100111101011101011011100110101101011110011111010110001001000101 e9e5b7cf5d6e6b66e9e5b7cf5d6e6b5e7d59e9e5b7cf5d6e6b66e9e5b7cf5d6e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)