To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??逸??惟??鼇??違?ゥ怨??畑 111000111010000000111111001111111000100011101101001111110011111110001000110100100011111100111111111010101000011100111111001111111000100011100001001111111000001101000100100010011000010100111111001111111001010010101000 e3a03f3f88ed3f3f88d23f3fea873f3f88e13f834489853f3f94a8
EUC-JP 罌??逸??惟??鼇??違?ゥ怨??畑 111001101010001000111111001111111011000011101111001111110011111110110000110101000011111100111111111100111110011100111111001111111011000011100011001111111010010110100101101100011110010100111111001111111100100010101010 e6a23f3fb0ef3f3fb0d43f3ff3e73f3fb0e33fa5a5b1e53f3fc8aa
UTF-8 罌삳냲逸썲컜惟듭뒳鼇앷퍛違쇤ゥ怨븍눜畑 111001111011110110001100111011001000001010110011111010111000001110110010111010011000000010111000111011001000110110110010111011001011101110011100111001101000001110011111111010111001001110101101111010111001001010110011111010011011110010000111111011001001010110110111111011011000110110011011111010011000000110010101111011001000011110100100111000111000001010100101111001101000000010101000111010111011100010001101111010111000100010011100111001111001010110010001 e7bd8cec82b3eb83b2e980b8ec8db2ecbb9ce6839feb93adeb92b3e9bc87ec95b7ed8d9be98195ec87a4e382a5e680a8ebb88deb889ce79591
UHC 罌삳냲逸썲컜惟듭뒳鼇앷퍛違쇤ゥ怨븍눜畑 1110010110100010101110111110101110000110100000101110110011101111101111011110010110110000100001111110101011101110101101011110110010001010101011001110100010101000100111011110101010111011100100101110101011011110101111001110100110101011101001011110101010110011101110101110101110000111101101001110111110100101 e5a2bbeb8682ecefbde5b087eaeeb5ec8aace8a89deabb92eadebce9aba5eab3baeb87b4efa5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)