To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌???ο???????徇??矣??夜???? 11100010101000110011111100111111001111111000001111001101001111110011111100111111001111110011111100111111001111111001110001101101001111110011111111100001111000010011111100111111100101101110100100111111001111110011111100111111 e2a33f3f3f83cd3f3f3f3f3f3f3f9c6d3f3fe1e13f3f96e93f3f3f3f
EUC-JP 筌?ł?ο????孼??徇??矣??夜??庾? 11100100101001010011111110001111101010011100100000111111101001101100111100111111001111110011111100111111100011111011101011000011001111110011111111010111110011100011111100111111111000101110001100111111001111111100110011101011001111110011111110001111101111001100111000111111 e4a53f8fa9c83fa6cf3f3f3f3f8fbac33f3fd7ce3f3fe2e33f3fcceb3f3f8fbcce3f
UTF-8 筌뤿ł履ο쭗硫깅펳孼뽮쑬徇섊뮫矣뚣걯夜쏅쪇庾퀲 11100111101011011000110011101011101001001011111111000101100000101110111110100111100111111100111010111111111011001010110110010111111011111010011110001110111010101011100110000101111011011000111010110011111001011010110110111100111010111011110110101110111011001001000110101100111001011011111010000111111011001000010010001010111010111010111010101011111001111001111110100011111010111001101010100011111010101011000110101111111001011010010010011100111011001000111110000101111011001010101010000111111001011011101010111110111011011000000010110010 e7ad8ceba4bfc582efa79fcebfecad97efa78eeab985ed8eb3e5adbcebbdaeec91ace5be87ec848aebaeabe79fa3eb9aa3eab1afe5a49cec8f85ecaa87e5babeed80b2
UHC 筌뤿ł履ο쭗硫깅펳孼뽮쑬徇섊뮫矣뚣걯夜쏅쪇庾퀲 11101111101001111000111111101011101010011010100111101100101010101010010111101111101001111000111111101011101010011011000111101011101111001000010111100101111011011001011011101010101111101010100011100010110111111001100011100111100100101011010111101011111110001000110011100011100000011001100011100101101010001001101111101011101001011000000111101010111011001011010001000101 efa78feba9a9ecaaa5efa78feba9b1ebbc85e5ed96eabea8e2df98e792b5ebf88ce38198e5a89beba581eaecb445

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)