To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????P 0011111100111111001111110011111100111111001111110011111101010000 3f3f3f3f3f3f3f50
SJIS-WIN テ榲ウテイテャP 110000111001111011000011101100111100001110110010110000111010110001010000 c39ec3b3c3b2c3ac50
EUC-JP テ榲ウテイテャP 100011101100001111011100110001011000111010110011100011101100001110001110101100101000111011000011100011101010110001010000 8ec3dcc58eb38ec38eb28ec38eac50
UTF-8 テ榲ウテイテャP 11101111101111101000001111100110101001101011001011101111101111011011001111101111101111101000001111101111101111011011001011101111101111101000001111101111101111011010110001010000 efbe83e6a6b2efbdb3efbe83efbdb2efbe83efbdac50
UHC ???????P 0011111100111111001111110011111100111111001111110011111101010000 3f3f3f3f3f3f3f50

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)