To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 莨粋扱W^莨粋扱\}v莨粋扱W^莨粋扱\}vB 1110010010111100100100001000100010001000101101010101011101011110111001001011110010010000100010001000100010110101010111000111110101110110111001001011110010010000100010001000100010110101010101110101111011100100101111001001000010001000100010001011010101011100011111010111011001000010 e4bc908888b5575ee4bc908888b55c7d76e4bc908888b5575ee4bc908888b55c7d7642
EUC-JP 莨粋扱W^莨粋扱\}v莨粋扱W^莨粋扱\}vB 1110100010111110101111111110100010110000101101110101011101011110111010001011111010111111111010001011000010110111010111000111110101110110111010001011111010111111111010001011000010110111010101110101111011101000101111101011111111101000101100001011011101011100011111010111011001000010 e8bebfe8b0b7575ee8bebfe8b0b75c7d76e8bebfe8b0b7575ee8bebfe8b0b75c7d7642
UTF-8 莨粋扱W^莨粋扱\}v莨粋扱W^莨粋扱\}vB 1110100010001110101010001110011110110010100010111110011010001001101100010101011101011110111010001000111010101000111001111011001010001011111001101000100110110001010111000111110101110110111010001000111010101000111001111011001010001011111001101000100110110001010101110101111011101000100011101010100011100111101100101000101111100110100010011011000101011100011111010111011001000010 e88ea8e7b28be689b1575ee88ea8e7b28be689b15c7d76e88ea8e7b28be689b1575ee88ea8e7b28be689b15c7d7642
UHC ??扱W^??扱\}v??扱W^??扱\}vB 001111110011111111010000111000100101011101011110001111110011111111010000111000100101110001111101011101100011111100111111110100001110001001010111010111100011111100111111110100001110001001011100011111010111011001000010 3f3fd0e2575e3f3fd0e25c7d763f3fd0e2575e3f3fd0e25c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)