To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 午ョ????泳ц?^ 1000110011011111100000111000011100111111001111110011111100111111100010010110101010000100100010000011111101011110 8cdf83873f3f3f3f896a84883f5e
EUC-JP 午ョ????泳ц?^ 1011100011100001101001011110011100111111001111110011111100111111101100011100101110100111111010000011111101011110 b8e1a5e73f3f3f3fb1cba7e83f5e
UTF-8 午ョ땝療귟룶泳ц럼^ 111001011000110110001000111000111000001110100111111010111001010110011101111011111010011110000001111010101011011110011111111010111010001110110110111001101011001110110011110100011000011011101011100111111011110001011110 e58d88e383a7eb959defa781eab79feba3b6e6b3b3d186eb9fbc5e
UHC 午ョ땝療귟룶泳ц럼^ 11100111111011011010101111100111101101101010110011101000111111101000001011101000100011111010101111100111101101101010110011101000101101111011001101011110 e7edabe7b6ace8fe82e88fabe7b6ace8b7b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)