To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??議??濡?ぁ???壹??儒??筌 1001010010101000001111110011111110001011011000110011111100111111100101000100011100111111100000101001111100111111001111110011111110011010111000110011111100111111100011101111001000111111001111111110001010100011 94a83f3f8b633f3f94473f829f3f3f3f9ae33f3f8ef23f3fe2a3
EUC-JP 畑??議??濡?ぁ???壹?ł儒?Ŧ筌 110010001010101000111111001111111011010111000100001111110011111111000111101010000011111110100100101000010011111100111111001111111101010011100101001111111000111110101001110010001011110011110100001111111000111110101001101011111110010010100101 c8aa3f3fb5c43f3fc7a83fa4a13f3f3fd4e53f8fa9c8bcf43f8fa9afe4a5
UTF-8 畑밸뗀議곲턁濡녹ぁ捻뀀슖壹븀ł儒우Ŧ筌 11100111100101011001000111101011101100001011100011101011100101111000000011101000101011011011000011101010101100111011001011101101100001001000000111100110101111111010000111101011100001011011100111100011100000011000000111101111101001101010010011101011100000001000000011101100100010101001011011100101101000111011100111101011101110001000000011000101100000101110010110000100100100101110110010011010101100001100010110100110111001111010110110001100 e79591ebb0b8eb9780e8adb0eab3b2ed8481e6bfa1eb85b9e38181efa6a4eb8080ec8a96e5a3b9ebb880c582e58492ec9ab0c5a6e7ad8c
UHC 畑밸뗀議곲턁濡녹ぁ捻뀀슖壹븀ł儒우Ŧ筌 1110111110100101101110011110101110110110101111101110110010100001100000011110100110110101100111011110101110100001101100111110110010101010101000011110011011110111101100101110101110011010101001011110110011101100101110101110011110101001101010011110101011100011101111111110110010101000101011101110111110100111 efa5b9ebb6beeca181e9b59deba1b3ecaaa1e6f7b2eb9aa5ececbae7a9a9eae3bfeca8aeefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)