To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???q}???q{^ 0011111100111111001111110111000101111101001111110011111100111111011100010111101101011110 3f3f3f717d3f3f3f717b5e
SJIS-WIN 叩尊奪q}叩尊奪q{^ 1001001001000000100100011011100010010010010001000111000101111101100100100100000010010001101110001001001001000100011100010111101101011110 924091b89244717d924091b89244717b5e
EUC-JP 叩尊奪q}叩尊奪q{^ 1100001110100001110000101011101011000011101001010111000101111101110000111010000111000010101110101100001110100101011100010111101101011110 c3a1c2bac3a5717dc3a1c2bac3a5717b5e
UTF-8 叩尊奪q}叩尊奪q{^ 1110010110001111101010011110010110110000100010101110010110100101101010100111000101111101111001011000111110101001111001011011000010001010111001011010010110101010011100010111101101011110 e58fa9e5b08ae5a5aa717de58fa9e5b08ae5a5aa717b5e
UHC 叩尊奪q}叩尊奪q{^ 1100110110110000111100001110111011110111101011000111000101111101110011011011000011110000111011101111011110101100011100010111101101011110 cdb0f0eef7ac717dcdb0f0eef7ac717b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)