To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W????Jn}????W????Jn{^ 00111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111110100111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111101101011110 3f3f3f3f573f3f3f3f4a6e7d3f3f3f3f573f3f3f3f4a6e7b5e
SJIS-WIN 陷エ鄒」W陷エ鄒」Jn}陷エ鄒」W陷エ鄒」Jn{^ 111010001001110010110100111001111011111010100011010101111110100010011100101101001110011110111110101000110100101001101110011111011110100010011100101101001110011110111110101000110101011111101000100111001011010011100111101111101010001101001010011011100111101101011110 e89cb4e7bea357e89cb4e7bea34a6e7de89cb4e7bea357e89cb4e7bea34a6e7b5e
EUC-JP 陷エ鄒」W陷エ鄒」Jn}陷エ鄒」W陷エ鄒」Jn{^ 1110111111111100100011101011010011101110110000001000111010100011010101111110111111111100100011101011010011101110110000001000111010100011010010100110111001111101111011111111110010001110101101001110111011000000100011101010001101010111111011111111110010001110101101001110111011000000100011101010001101001010011011100111101101011110 effc8eb4eec08ea357effc8eb4eec08ea34a6e7deffc8eb4eec08ea357effc8eb4eec08ea34a6e7b5e
UTF-8 陷エ鄒」W陷エ鄒」Jn}陷エ鄒」W陷エ鄒」Jn{^ 111010011001100110110111111011111011110110110100111010011000010010010010111011111011110110100011010101111110100110011001101101111110111110111101101101001110100110000100100100101110111110111101101000110100101001101110011111011110100110011001101101111110111110111101101101001110100110000100100100101110111110111101101000110101011111101001100110011011011111101111101111011011010011101001100001001001001011101111101111011010001101001010011011100111101101011110 e999b7efbdb4e98492efbda357e999b7efbdb4e98492efbda34a6e7de999b7efbdb4e98492efbda357e999b7efbdb4e98492efbda34a6e7b5e
UHC 陷?鄒?W陷?鄒?Jn}陷?鄒?W陷?鄒?Jn{^ 111110011110100000111111111101011101101100111111010101111111100111101000001111111111010111011011001111110100101001101110011111011111100111101000001111111111010111011011001111110101011111111001111010000011111111110101110110110011111101001010011011100111101101011110 f9e83ff5db3f57f9e83ff5db3f4a6e7df9e83ff5db3f57f9e83ff5db3f4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)