To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???猷??儀??藥?????韋?????爾? 00111111001111110011111110010111010100010011111100111111100010110101011000111111001111111110010101011010001111110011111100111111001111110011111111101000111010000011111100111111001111110011111100111111100011101010001000111111 3f3f3f97513f3f8b563f3fe55a3f3f3f3f3fe8e83f3f3f3f3f8ea23f
EUC-JP ???猷??儀??藥?????韋?????爾? 00111111001111110011111111001101101100100011111100111111101101011011011100111111001111111110100110111011001111110011111100111111001111110011111111110000111010100011111100111111001111110011111100111111101111001010010000111111 3f3f3fcdb23f3fb5b73f3fe9bb3f3f3f3f3ff0ea3f3f3f3f3fbca43f
UTF-8 歷띰퐣猷녽뇳儀숈춶藥꿸퍏流쒒레韋몃턀嶺뚮봿爾좪 111011111010011010001100111010111001110110110000111011011001000010100011111001111000110010110111111010111000010110111101111010111000011110110011111001011000010010000000111011001000100010001000111011001011011010110110111010001001011110100101111010101011111110111000111011011000110110001111111011111010011110001010111011001001001010010010111010111010000010001000111010011001111110001011111010111010101010000011111011011000010010000000111011111010011010101011111010111001101010101110111010111011010010111111111001111000100010111110111011001010001010101010 efa68ceb9db0ed90a3e78cb7eb85bdeb87b3e58480ec8888ecb6b6e897a5eabfb8ed8d8fefa78aec9292eba088e99f8bebaa83ed8480efa6abeb9aaeebb4bfe788beeca2aa
UHC 歷띰퐣猷녽뇳儀숈춶藥꿸퍏流쒒레韋몃턀嶺뚮봿爾좪 11100110101110001011011011101111101111011000110011101011101000111000011011101001100001111001011111101011111100001001100111101100101011011001001011100101101101111011001011101010101110111000011011101010111111001001110011101001101101111011100111101010110111111011100011101011101101011001110011100111101011011000110011101011100101001000011011101100101100111010000101000101 e6b8b6efbd8ceba386e98797ebf099ecad92e5b7b2eabb86eafc9ce9b7b9eadfb8ebb59ce7ad8ceb9486ecb3a145

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)