To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????L??????????????SB 001111110011111100111111001111110100110000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101001101000010 3f3f3f3f4c3f3f3f3f3f3f3f3f3f3f3f3f3f3f5342
SJIS-WIN 嶸殘嶸ゥL嶸カ嶸ィ嶸ィ巐楠仄巐懍ア被仄SB 11111010101101001001111101101011111110101011010010101001010011001111101010110100101101101111101010110100101010001111101010110100101010001111101010110110100100111110110110011000101110101111101010110110100111001110110010110001100101001110110110011000101110100101001101000010 fab49f6bfab4a94cfab4b6fab4a8fab4a8fab693ed98bafab69cecb194ed98ba5342
EUC-JP 嶸殘嶸ゥL嶸カ嶸ィ嶸ィ巐楠仄巐懍ア被仄SB 10001111101110111111010011011101110011001000111110111011111101001000111010101001010011001000111110111011111101001000111010110110100011111011101111110100100011101010100010001111101110111111010010001110101010001000111110111011111110011100011011101111110100001011110010001111101110111111100111011000111011101000111010110001110010001110111111010000101111000101001101000010 8fbbf4ddcc8fbbf48ea94c8fbbf48eb68fbbf48ea88fbbf48ea88fbbf9c6efd0bc8fbbf9d8ee8eb1c8efd0bc5342
UTF-8 嶸殘嶸ゥL嶸カ嶸ィ嶸ィ巐楠仄巐懍ア被仄SB 111001011011011010111000111001101010111010011000111001011011011010111000111011111011110110101001010011001110010110110110101110001110111110111101101101101110010110110110101110001110111110111101101010001110010110110110101110001110111110111101101010001110010110110111100100001110011010100101101000001110010010111011100001001110010110110111100100001110011010000111100011011110111110111101101100011110100010100010101010111110010010111011100001000101001101000010 e5b6b8e6ae98e5b6b8efbda94ce5b6b8efbdb6e5b6b8efbda8e5b6b8efbda8e5b790e6a5a0e4bb84e5b790e6878defbdb1e8a2abe4bb845342
UHC 嶸殘嶸?L嶸?嶸?嶸??楠仄???被仄SB 11100111101011101110110111010001111001111010111000111111010011001110011110101110001111111110011110101110001111111110011110101110001111110011111111010001111110001111011010110001001111110011111100111111111110011010110011110110101100010101001101000010 e7aeedd1e7ae3f4ce7ae3fe7ae3fe7ae3f3fd1f8f6b13f3f3ff9acf6b15342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)