To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 遯堤粟W^遯堤粟\}v遯堤粟W^遯堤粟\}vB 1110011110101010100100101110011110001000101111100101011101011110111001111010101010010010111001111000100010111110010111000111110101110110111001111010101010010010111001111000100010111110010101110101111011100111101010101001001011100111100010001011111001011100011111010111011001000010 e7aa92e788be575ee7aa92e788be5c7d76e7aa92e788be575ee7aa92e788be5c7d7642
EUC-JP 遯堤粟W^遯堤粟\}v遯堤粟W^遯堤粟\}vB 1110111010101100110001001110100110110000110000000101011101011110111011101010110011000100111010011011000011000000010111000111110101110110111011101010110011000100111010011011000011000000010101110101111011101110101011001100010011101001101100001100000001011100011111010111011001000010 eeacc4e9b0c0575eeeacc4e9b0c05c7d76eeacc4e9b0c0575eeeacc4e9b0c05c7d7642
UTF-8 遯堤粟W^遯堤粟\}v遯堤粟W^遯堤粟\}vB 1110100110000001101011111110010110100000101001001110011110110010100111110101011101011110111010011000000110101111111001011010000010100100111001111011001010011111010111000111110101110110111010011000000110101111111001011010000010100100111001111011001010011111010101110101111011101001100000011010111111100101101000001010010011100111101100101001111101011100011111010111011001000010 e981afe5a0a4e7b29f575ee981afe5a0a4e7b29f5c7d76e981afe5a0a4e7b29f575ee981afe5a0a4e7b29f5c7d7642
UHC 遯堤粟W^遯堤粟\}v遯堤粟W^遯堤粟\}vB 1101010011101110111100001010011111100001110110000101011101011110110101001110111011110000101001111110000111011000010111000111110101110110110101001110111011110000101001111110000111011000010101110101111011010100111011101111000010100111111000011101100001011100011111010111011001000010 d4eef0a7e1d8575ed4eef0a7e1d85c7d76d4eef0a7e1d8575ed4eef0a7e1d85c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)