To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 湲?臍??????絅??誌攪?絅????? 100111111101000100111111111001000110000000111111001111110011111100111111001111110011111111100011010001000011111100111111100011101000111110011101100110000011111111100011010001000011111100111111001111110011111100111111 9fd13fe4603f3f3f3f3f3fe3443f3f8e8f9d983fe3443f3f3f3f3f
EUC-JP 湲?臍?????邕絅??誌攪?絅????邕 11011110110100110011111111100111110000010011111100111111001111110011111100111111100011111110000111101101111001011010010100111111001111111011101111101111110110011111100000111111111001011010010100111111001111110011111100111111100011111110000111101101 ded33fe7c13f3f3f3f3f8fe1ede5a53f3fbbefd9f83fe5a53f3f3f3f8fe1ed
UTF-8 湲렣臍쇘렩잭렔렡邕絅렧렪誌攪렫絅렩잭렔렡邕 111001101011100110110010111010111010000010100011111010001000011110001101111011001000011110011000111010111010000010101001111011001001111010101101111010111010000010010100111010111010000010100001111010011000001010010101111001111011010110000101111010111010000010100111111010111010000010101010111010001010101010001100111001101001010010101010111010111010000010101011111001111011010110000101111010111010000010101001111011001001111010101101111010111010000010010100111010111010000010100001111010011000001010010101 e6b9b2eba0a3e8878dec8798eba0a9ec9eadeba094eba0a1e98295e7b585eba0a7eba0aae8aa8ce694aaeba0abe7b585eba0a9ec9eadeba094eba0a1e98295
UHC 湲렣臍쇘렩잭렔렡邕絅렧렪誌攪렫絅렩잭렔렡邕 111010101011100010001110101101001111000010110000101111001110011110001110101101111100000011101000100011101010100110001110101100101110100010111011110011001110011110001110101101101000111010111000111100101011110011001110111001101000111010111001110011001110011110001110101101111100000011101000100011101010100110001110101100101110100010111011 eab88eb4f0b0bce78eb7c0e88ea98eb2e8bbcce78eb68eb8f2bccee68eb9cce78eb7c0e88ea98eb2e8bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)