To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???竊??筍ル???????竊??筍ル????^ 00111111001111110011111111100010100001100011111100111111111000101010000110000011100010110011111100111111001111110011111100111111001111110011111111100010100001100011111100111111111000101010000110000011100010110011111100111111001111110011111101011110 3f3f3fe2863f3fe2a1838b3f3f3f3f3f3f3fe2863f3fe2a1838b3f3f3f3f5e
EUC-JP ???竊??筍ル?孼?????竊??筍ル?孼??^ 0011111100111111001111111110001111100110001111110011111111100100101000111010010111101011001111111000111110111010110000110011111100111111001111110011111100111111111000111110011000111111001111111110010010100011101001011110101100111111100011111011101011000011001111110011111101011110 3f3f3fe3e63f3fe4a3a5eb3f8fbac33f3f3f3f3fe3e63f3fe4a3a5eb3f8fbac33f3f5e
UTF-8 僚녹뼔竊섊댚筍ル븶孼뽮킋僚녹뼔竊섊댚筍ル븶孼뽮킋^ 11101111101001101011101111101011100001011011100111101011101111001001010011100111101010111000101011101100100001001000101011101011100011001001101011100111101011011000110111100011100000111010101111101011101110001011011011100101101011011011110011101011101111011010111011101101100000101000101111101111101001101011101111101011100001011011100111101011101111001001010011100111101010111000101011101100100001001000101011101011100011001001101011100111101011011000110111100011100000111010101111101011101110001011011011100101101011011011110011101011101111011010111011101101100000101000101101011110 efa6bbeb85b9ebbc94e7ab8aec848aeb8c9ae7ad8de383abebb8b6e5adbcebbdaeed828befa6bbeb85b9ebbc94e7ab8aec848aeb8c9ae7ad8de383abebb8b6e5adbcebbdaeed828b5e
UHC 僚녹뼔竊섊댚筍ル븶孼뽮킋僚녹뼔竊섊댚筍ル븶孼뽮킋^ 11101000111010001011001111101100100101101001110011101111101111001001100011100111100010001011111011100010111011001010101111101011100101011001111111100101111011011001011011101010101101001001011111101000111010001011001111101100100101101001110011101111101111001001100011100111100010001011111011100010111011001010101111101011100101011001111111100101111011011001011011101010101101001001011101011110 e8e8b3ec969cefbc98e788bee2ecabeb959fe5ed96eab497e8e8b3ec969cefbc98e788bee2ecabeb959fe5ed96eab4975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)