To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨籠刀梓?s砥?雋梨淨籠刀梓?s砥?雋悧^ 10011111110001001110001011000100100100111000000110001000101100100011111110000010100100111001001101110101001111111110100010110010100101111001110010011111110001001110001011000100100100111000000110001000101100100011111110000010100100111001001101110101001111111110100010110010100111001010010001011110 9fc4e2c4938188b23f829393753fe8b2979c9fc4e2c4938188b23f829393753fe8b29ca45e
EUC-JP 淨籠刀梓?s砥?雋梨淨籠刀梓?s砥?雋悧^ 11011110110001101110010011000110110001011110000110110000101101000011111110100011111100111100010111010110001111111111000010110100110011011111110011011110110001101110010011000110110001011110000110110000101101000011111110100011111100111100010111010110001111111111000010110100110110001010011001011110 dec6e4c6c5e1b0b43fa3f3c5d63ff0b4cdfcdec6e4c6c5e1b0b43fa3f3c5d63ff0b4d8a65e
UTF-8 淨籠刀梓띔s砥렫雋梨淨籠刀梓띔s砥렫雋悧^ 11100110101101111010100011100111101100011010000011100101100010001000000011100110101000101001001111101011100111011001010011101111101111011001001111100111101000001010010111101011101000001010101111101001100110111000101111100110101000101010100011100110101101111010100011100111101100011010000011100101100010001000000011100110101000101001001111101011100111011001010011101111101111011001001111100111101000001010010111101011101000001010101111101001100110111000101111100110100000101010011101011110 e6b7a8e7b1a0e58880e6a293eb9d94efbd93e7a0a5eba0abe99b8be6a2a8e6b7a8e7b1a0e58880e6a293eb9d94efbd93e7a0a5eba0abe99b8be682a75e
UHC 淨籠刀梓띔s砥렫雋梨淨籠刀梓띔s砥렫雋悧^ 1110111111100100110101101110101111010011111011111110111010101001101101101110101010100011111100111111001010110010100011101011100111110001111001101101011111011110111011111110010011010110111010111101001111101111111011101010100110110110111010101010001111110011111100101011001010001110101110011111000111100110110101111101110001011110 efe4d6ebd3efeea9b6eaa3f3f2b28eb9f1e6d7deefe4d6ebd3efeea9b6eaa3f3f2b28eb9f1e6d7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)