To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竪賊尊誰遜賊竪測即誰遜他竪賊尊誰遜賊竪測即誰遜他^ 10010010010001111001000110101111100100011011100010010010010011101001000110111011100100011010111110010010010001111001000110101010100100011010011010010010010011101001000110111011100100011011110010010010010001111001000110101111100100011011100010010010010011101001000110111011100100011010111110010010010001111001000110101010100100011010011010010010010011101001000110111011100100011011110001011110 924791af91b8924e91bb91af924791aa91a6924e91bb91bc924791af91b8924e91bb91af924791aa91a6924e91bb91bc5e
EUC-JP 竪賊尊誰遜賊竪測即誰遜他竪賊尊誰遜賊竪測即誰遜他^ 11000011101010001100001010110001110000101011101011000011101011111100001010111101110000101011000111000011101010001100001010101100110000101010100011000011101011111100001010111101110000101011111011000011101010001100001010110001110000101011101011000011101011111100001010111101110000101011000111000011101010001100001010101100110000101010100011000011101011111100001010111101110000101011111001011110 c3a8c2b1c2bac3afc2bdc2b1c3a8c2acc2a8c3afc2bdc2bec3a8c2b1c2bac3afc2bdc2b1c3a8c2acc2a8c3afc2bdc2be5e
UTF-8 竪賊尊誰遜賊竪測即誰遜他竪賊尊誰遜賊竪測即誰遜他^ 11100111101010111010101011101000101100111000101011100101101100001000101011101000101010101011000011101001100000011001110011101000101100111000101011100111101010111010101011100110101110001010110011100101100011011011001111101000101010101011000011101001100000011001110011100100101110111001011011100111101010111010101011101000101100111000101011100101101100001000101011101000101010101011000011101001100000011001110011101000101100111000101011100111101010111010101011100110101110001010110011100101100011011011001111101000101010101011000011101001100000011001110011100100101110111001011001011110 e7abaae8b38ae5b08ae8aab0e9819ce8b38ae7abaae6b8ace58db3e8aab0e9819ce4bb96e7abaae8b38ae5b08ae8aab0e9819ce8b38ae7abaae6b8ace58db3e8aab0e9819ce4bb965e
UHC 竪賊尊誰遜賊竪測?誰遜他竪賊尊誰遜賊竪測?誰遜他^ 1110001010110101111011101110010011110000111011101110001011000001111000011110000111101110111001001110001010110101111101101011010000111111111000101100000111100001111000011111011011100010111000101011010111101110111001001111000011101110111000101100000111100001111000011110111011100100111000101011010111110110101101000011111111100010110000011110000111100001111101101110001001011110 e2b5eee4f0eee2c1e1e1eee4e2b5f6b43fe2c1e1e1f6e2e2b5eee4f0eee2c1e1e1eee4e2b5f6b43fe2c1e1e1f6e25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)