To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN ツ坦ツ堕アツ達ツ炭W}ツ坦ツ堕アツ達ツ炭W{^ 11000010100100100101001011000010100100011100001010110001110000101001001001000010110000101001001001011001010101110111110111000010100100100101001011000010100100011100001010110001110000101001001001000010110000101001001001011001010101110111101101011110 c29252c291c2b1c29242c29259577dc29252c291c2b1c29242c29259577b5e
EUC-JP ツ坦ツ堕アツ達ツ炭W}ツ坦ツ堕アツ達ツ炭W{^ 1000111011000010110000111011001110001110110000101100001011000100100011101011000110001110110000101100001110100011100011101100001011000011101110100101011101111101100011101100001011000011101100111000111011000010110000101100010010001110101100011000111011000010110000111010001110001110110000101100001110111010010101110111101101011110 8ec2c3b38ec2c2c48eb18ec2c3a38ec2c3ba577d8ec2c3b38ec2c2c48eb18ec2c3a38ec2c3ba577b5e
UTF-8 ツ坦ツ堕アツ達ツ炭W}ツ坦ツ堕アツ達ツ炭W{^ 1110111110111110100000101110010110011101101001101110111110111110100000101110010110100000100101011110111110111101101100011110111110111110100000101110100110000001100101001110111110111110100000101110011110000010101011010101011101111101111011111011111010000010111001011001110110100110111011111011111010000010111001011010000010010101111011111011110110110001111011111011111010000010111010011000000110010100111011111011111010000010111001111000001010101101010101110111101101011110 efbe82e59da6efbe82e5a095efbdb1efbe82e98194efbe82e782ad577defbe82e59da6efbe82e5a095efbdb1efbe82e98194efbe82e782ad577b5e
UHC ?坦????達?炭W}?坦????達?炭W{^ 0011111111110111101001000011111100111111001111110011111111010011101110010011111111110111101010010101011101111101001111111111011110100100001111110011111100111111001111111101001110111001001111111111011110101001010101110111101101011110 3ff7a43f3f3f3fd3b93ff7a9577d3ff7a43f3f3f3fd3b93ff7a9577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)