To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 邪粤爵W^邪粤爵\}v邪粤爵W^邪粤爵\}vB 1000111011010111111000101110001110001110110111010101011101011110100011101101011111100010111000111000111011011101010111000111110101110110100011101101011111100010111000111000111011011101010101110101111010001110110101111110001011100011100011101101110101011100011111010111011001000010 8ed7e2e38edd575e8ed7e2e38edd5c7d768ed7e2e38edd575e8ed7e2e38edd5c7d7642
EUC-JP 邪粤爵W^邪粤爵\}v邪粤爵W^邪粤爵\}vB 1011110011011001111001001110010110111100110111110101011101011110101111001101100111100100111001011011110011011111010111000111110101110110101111001101100111100100111001011011110011011111010101110101111010111100110110011110010011100101101111001101111101011100011111010111011001000010 bcd9e4e5bcdf575ebcd9e4e5bcdf5c7d76bcd9e4e5bcdf575ebcd9e4e5bcdf5c7d7642
UTF-8 邪粤爵W^邪粤爵\}v邪粤爵W^邪粤爵\}vB 1110100110000010101010101110011110110010101001001110011110001000101101010101011101011110111010011000001010101010111001111011001010100100111001111000100010110101010111000111110101110110111010011000001010101010111001111011001010100100111001111000100010110101010101110101111011101001100000101010101011100111101100101010010011100111100010001011010101011100011111010111011001000010 e982aae7b2a4e788b5575ee982aae7b2a4e788b55c7d76e982aae7b2a4e788b5575ee982aae7b2a4e788b55c7d7642
UHC 邪?爵W^邪?爵\}v邪?爵W^邪?爵\}vB 11011110111101110011111111101101110010010101011101011110110111101111011100111111111011011100100101011100011111010111011011011110111101110011111111101101110010010101011101011110110111101111011100111111111011011100100101011100011111010111011001000010 def73fedc9575edef73fedc95c7d76def73fedc9575edef73fedc95c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)