To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN タ蠱アタルタ蠱アタルn}タ蠱アタルタ蠱アタルn{^ 1100000011100101110000011011000111000000110110011100000011100101110000011011000111000000110110010110111001111101110000001110010111000001101100011100000011011001110000001110010111000001101100011100000011011001011011100111101101011110 c0e5c1b1c0d9c0e5c1b1c0d96e7dc0e5c1b1c0d9c0e5c1b1c0d96e7b5e
EUC-JP タ蠱アタルタ蠱アタルn}タ蠱アタルタ蠱アタルn{^ 100011101100000011101010110000111000111010110001100011101100000010001110110110011000111011000000111010101100001110001110101100011000111011000000100011101101100101101110011111011000111011000000111010101100001110001110101100011000111011000000100011101101100110001110110000001110101011000011100011101011000110001110110000001000111011011001011011100111101101011110 8ec0eac38eb18ec08ed98ec0eac38eb18ec08ed96e7d8ec0eac38eb18ec08ed98ec0eac38eb18ec08ed96e7b5e
UTF-8 タ蠱アタルタ蠱アタルn}タ蠱アタルタ蠱アタルn{^ 1110111110111110100000001110100010100000101100011110111110111101101100011110111110111110100000001110111110111110100110011110111110111110100000001110100010100000101100011110111110111101101100011110111110111110100000001110111110111110100110010110111001111101111011111011111010000000111010001010000010110001111011111011110110110001111011111011111010000000111011111011111010011001111011111011111010000000111010001010000010110001111011111011110110110001111011111011111010000000111011111011111010011001011011100111101101011110 efbe80e8a0b1efbdb1efbe80efbe99efbe80e8a0b1efbdb1efbe80efbe996e7defbe80e8a0b1efbdb1efbe80efbe99efbe80e8a0b1efbdb1efbe80efbe996e7b5e
UHC ?蠱????蠱???n}?蠱????蠱???n{^ 0011111111001101110011000011111100111111001111110011111111001101110011000011111100111111001111110110111001111101001111111100110111001100001111110011111100111111001111111100110111001100001111110011111100111111011011100111101101011110 3fcdcc3f3f3f3fcdcc3f3f3f6e7d3fcdcc3f3f3f3fcdcc3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)