To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 寤μ??η????應?????循よ???? 100110111000100010000011110010100011111100111111100000111100010100111111001111110011111100111111100111001110010000111111001111110011111100111111001111111000111101111010100000101110011000111111001111110011111100111111 9b8883ca3f3f83c53f3f3f3f9ce43f3f3f3f3f8f7a82e63f3f3f3f
EUC-JP 寤μ??η????應?????循よ?洧?? 1101010111101000101001101100110000111111001111111010011011000111001111110011111100111111001111111101100011100110001111110011111100111111001111110011111110111101110110111010010011101000001111111000111111000111101101000011111100111111 d5e8a6cc3f3fa6c73f3f3f3fd8e63f3f3f3f3fbddba4e83f8fc7b43f3f
UTF-8 寤μ뜾杻η넭類배뿥應몄뵁凉깅봺循よ삏洧띠떼 11100101101011111010010011001110101111001110101110011100101111101110111110100111100010001100111010110111111010111000010010101101111011111010011110010000111010111011000010110000111010111011111110100101111001101000011110001001111010111010101010000100111010111011010110000001111011111010010110111001111010101011100110000101111010111011010010111010111001011011111010101010111000111000001010001000111011001000001010001111111001101011010010100111111010111001110110100000111010111001011010111100 e5afa4cebceb9cbeefa788ceb7eb84adefa790ebb0b0ebbfa5e68789ebaa84ebb581efa5b9eab985ebb4bae5beaae38288ec828fe6b4a7eb9da0eb96bc
UHC 寤μ뜾杻η넭類배뿥應몄뵁凉깅봺循よ삏洧띠떼 111001111111010110100101111011001000110110111001111010101111010010100101111001111000011010101100111010111011101010111001111010001001011110100101111010111110101110111000111011001001010010000111111001011011110010110001111010111001010010000001111000101110000010101010111010001001100010010110111010101111101110110110111011001011011010111100 e7f5a5ec8db9eaf4a5e786acebbab9e897a5ebebb8ec9487e5bcb1eb9481e2e0aae89896eafbb6ecb6bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)