To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪竪孫誰遜属竪賊測誰遜捉竪竪孫誰遜属竪賊測誰遜捉B 10010010010001111001001001000111100100011011011110010010010011101001000110111011100100011010111010010010010001111001000110101111100100011010101010010010010011101001000110111011100100011010100010010010010001111001001001000111100100011011011110010010010011101001000110111011100100011010111010010010010001111001000110101111100100011010101010010010010011101001000110111011100100011010100001000010 9247924791b7924e91bb91ae924791af91aa924e91bb91a89247924791b7924e91bb91ae924791af91aa924e91bb91a842
EUC-JP 竪竪孫誰遜属竪賊測誰遜捉竪竪孫誰遜属竪賊測誰遜捉B 11000011101010001100001110101000110000101011100111000011101011111100001010111101110000101011000011000011101010001100001010110001110000101010110011000011101011111100001010111101110000101010101011000011101010001100001110101000110000101011100111000011101011111100001010111101110000101011000011000011101010001100001010110001110000101010110011000011101011111100001010111101110000101010101001000010 c3a8c3a8c2b9c3afc2bdc2b0c3a8c2b1c2acc3afc2bdc2aac3a8c3a8c2b9c3afc2bdc2b0c3a8c2b1c2acc3afc2bdc2aa42
UTF-8 竪竪孫誰遜属竪賊測誰遜捉竪竪孫誰遜属竪賊測誰遜捉B 11100111101010111010101011100111101010111010101011100101101011011010101111101000101010101011000011101001100000011001110011100101101100011001111011100111101010111010101011101000101100111000101011100110101110001010110011101000101010101011000011101001100000011001110011100110100011011000100111100111101010111010101011100111101010111010101011100101101011011010101111101000101010101011000011101001100000011001110011100101101100011001111011100111101010111010101011101000101100111000101011100110101110001010110011101000101010101011000011101001100000011001110011100110100011011000100101000010 e7abaae7abaae5adabe8aab0e9819ce5b19ee7abaae8b38ae6b8ace8aab0e9819ce68d89e7abaae7abaae5adabe8aab0e9819ce5b19ee7abaae8b38ae6b8ace8aab0e9819ce68d8942
UHC 竪竪孫誰遜?竪賊測誰遜捉竪竪孫誰遜?竪賊測誰遜捉B 1110001010110101111000101011010111100001110111011110001011000001111000011110000100111111111000101011010111101110111001001111011010110100111000101100000111100001111000011111001110110101111000101011010111100010101101011110000111011101111000101100000111100001111000010011111111100010101101011110111011100100111101101011010011100010110000011110000111100001111100111011010101000010 e2b5e2b5e1dde2c1e1e13fe2b5eee4f6b4e2c1e1e1f3b5e2b5e2b5e1dde2c1e1e13fe2b5eee4f6b4e2c1e1e1f3b542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)