To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ????????柔???????????h 00111111001111110011111100111111001111110011111100111111001111111000111101011111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f8f5f3f3f3f3f3f3f3f3f3f3f3f68
EUC-JP ????????柔???????????h 00111111001111110011111100111111001111110011111100111111001111111011110111000000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3fbdc03f3f3f3f3f3f3f3f3f3f3f68
UTF-8 溜삘뵗溜멊溜삠뀛柔섃뵗溜뺣졎栒붴뵔栒붾젘h 11101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111101011101010011000101011101111101001111000101111101100100000101010000011101011100000001001101111100110100111111001010011101100100001001000001111101011101101011001011111101111101001111000101111101011101110101010001111101100101000011000111011100110101000001001001011101011101101101011010011101011101101011001010011100110101000001001001011101011101101101011111011101100101000001001100001101000 efa78bec8298ebb597efa78beba98aefa78bec82a0eb809be69f94ec8483ebb597efa78bebbaa3eca18ee6a092ebb6b4ebb594e6a092ebb6beeca09868
UHC 溜삘뵗溜멊溜삠뀛柔섃뵗溜뺣졎栒붴뵔栒붾젘h 1110101011111110101110111110001010010100100110011110101011111110100100010100001011101010111111101011101111100011100001011001010011101010111101011001100011100010100101001001100111101010111111101001010111101011101000001011101111100010111000111001010011100010100101001001011011100010111000111001010011101011101000001001010001101000 eafebbe29499eafe9142eafebbe38594eaf598e29499eafe95eba0bbe2e394e29496e2e394eba09468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)