To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨籠刀臍??憶?雋梨淨籠刀臍??憶?雋悧^ 1001111111000100111000101100010010010011100000011110010001100000001111110011111110001001101011110011111111101000101100101001011110011100100111111100010011100010110001001001001110000001111001000110000000111111001111111000100110101111001111111110100010110010100111001010010001011110 9fc4e2c49381e4603f3f89af3fe8b2979c9fc4e2c49381e4603f3f89af3fe8b29ca45e
EUC-JP 淨籠刀臍??憶?雋梨淨籠刀臍??憶?雋悧^ 1101111011000110111001001100011011000101111000011110011111000001001111110011111110110010101100010011111111110000101101001100110111111100110111101100011011100100110001101100010111100001111001111100000100111111001111111011001010110001001111111111000010110100110110001010011001011110 dec6e4c6c5e1e7c13f3fb2b13ff0b4cdfcdec6e4c6c5e1e7c13f3fb2b13ff0b4d8a65e
UTF-8 淨籠刀臍잴릎憶렞雋梨淨籠刀臍잴릎憶렞雋悧^ 11100110101101111010100011100111101100011010000011100101100010001000000011101000100001111000110111101100100111101011010011101011101001101000111011100110100001101011011011101011101000001001111011101001100110111000101111100110101000101010100011100110101101111010100011100111101100011010000011100101100010001000000011101000100001111000110111101100100111101011010011101011101001101000111011100110100001101011011011101011101000001001111011101001100110111000101111100110100000101010011101011110 e6b7a8e7b1a0e58880e8878dec9eb4eba68ee686b6eba09ee99b8be6a2a8e6b7a8e7b1a0e58880e8878dec9eb4eba68ee686b6eba09ee99b8be682a75e
UHC 淨籠刀臍잴릎憶렞雋梨淨籠刀臍잴릎憶렞雋悧^ 1110111111100100110101101110101111010011111011111111000010110000110000001110101010111000101011011110010111100011100011101010111111110001111001101101011111011110111011111110010011010110111010111101001111101111111100001011000011000000111010101011100010101101111001011110001110001110101011111111000111100110110101111101110001011110 efe4d6ebd3eff0b0c0eab8ade5e38eaff1e6d7deefe4d6ebd3eff0b0c0eab8ade5e38eaff1e6d7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)