To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????Vn}????????Vn{^ 0011111100111111001111110011111100111111001111110011111100111111010101100110111001111101001111110011111100111111001111110011111100111111001111110011111101010110011011100111101101011110 3f3f3f3f3f3f3f3f566e7d3f3f3f3f3f3f3f3f566e7b5e
SJIS-WIN テョテ、テ。テ凡Vn}テョテ、テ。テ凡Vn{^ 11000011101011101100001110100100110000111010000111000011100101100111110101010110011011100111110111000011101011101100001110100100110000111010000111000011100101100111110101010110011011100111101101011110 c3aec3a4c3a1c3967d566e7dc3aec3a4c3a1c3967d566e7b5e
EUC-JP テョテ、テ。テ凡Vn}テョテ、テ。テ凡Vn{^ 100011101100001110001110101011101000111011000011100011101010010010001110110000111000111010100001100011101100001111001011110111100101011001101110011111011000111011000011100011101010111010001110110000111000111010100100100011101100001110001110101000011000111011000011110010111101111001010110011011100111101101011110 8ec38eae8ec38ea48ec38ea18ec3cbde566e7d8ec38eae8ec38ea48ec38ea18ec3cbde566e7b5e
UTF-8 テョテ、テ。テ凡Vn}テョテ、テ。テ凡Vn{^ 11101111101111101000001111101111101111011010111011101111101111101000001111101111101111011010010011101111101111101000001111101111101111011010000111101111101111101000001111100101100001111010000101010110011011100111110111101111101111101000001111101111101111011010111011101111101111101000001111101111101111011010010011101111101111101000001111101111101111011010000111101111101111101000001111100101100001111010000101010110011011100111101101011110 efbe83efbdaeefbe83efbda4efbe83efbda1efbe83e587a1566e7defbe83efbdaeefbe83efbda4efbe83efbda1efbe83e587a1566e7b5e
UHC ???????凡Vn}???????凡Vn{^ 00111111001111110011111100111111001111110011111100111111110110111110110101010110011011100111110100111111001111110011111100111111001111110011111100111111110110111110110101010110011011100111101101011110 3f3f3f3f3f3f3fdbed566e7d3f3f3f3f3f3f3fdbed566e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)