To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 堤?旭?原蝎??W}堤?旭?原蝎??W{^ 1001001011100111001111111000100010101110001111111000110010110100111001011001100100111111001111110101011101111101100100101110011100111111100010001010111000111111100011001011010011100101100110010011111100111111010101110111101101011110 92e73f88ae3f8cb4e5993f3f577d92e73f88ae3f8cb4e5993f3f577b5e
EUC-JP 堤?旭?原蝎??W}堤?旭?原蝎??W{^ 1100010011101001001111111011000010110000001111111011100010110110111010011111100100111111001111110101011101111101110001001110100100111111101100001011000000111111101110001011011011101001111110010011111100111111010101110111101101011110 c4e93fb0b03fb8b6e9f93f3f577dc4e93fb0b03fb8b6e9f93f3f577b5e
UTF-8 堤렚旭렏原蝎렱렡W}堤렚旭렏原蝎렱렡W{^ 1110010110100000101001001110101110100000100110101110011010010111101011011110101110100000100011111110010110001110100111111110100010011101100011101110101110100000101100011110101110100000101000010101011101111101111001011010000010100100111010111010000010011010111001101001011110101101111010111010000010001111111001011000111010011111111010001001110110001110111010111010000010110001111010111010000010100001010101110111101101011110 e5a0a4eba09ae697adeba08fe58e9fe89d8eeba0b1eba0a1577de5a0a4eba09ae697adeba08fe58e9fe89d8eeba0b1eba0a1577b5e
UHC 堤렚旭렏原蝎렱렡W}堤렚旭렏原蝎렱렡W{^ 11110000101001111000111010101101111010011110111110001110101001011110101010101011110010101110100110001110101111101000111010110010010101110111110111110000101001111000111010101101111010011110111110001110101001011110101010101011110010101110100110001110101111101000111010110010010101110111101101011110 f0a78eade9ef8ea5eaabcae98ebe8eb2577df0a78eade9ef8ea5eaabcae98ebe8eb2577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)