To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??獄??雅????????濡?????? 1110000011001110001111110011111110001101100101100011111100111111100010011110101100111111001111110011111100111111001111110011111100111111001111111001010001000111001111110011111100111111001111110011111100111111 e0ce3f3f8d963f3f89eb3f3f3f3f3f3f3f3f94473f3f3f3f3f3f
EUC-JP 猥??獄??雅????????濡?????琯 11100000110100000011111100111111101110011111011000111111001111111011001011101101001111110011111100111111001111110011111100111111001111110011111111000111101010000011111100111111001111110011111100111111100011111100110010110011 e0d03f3fb9f63f3fb2ed3f3f3f3f3f3f3f3fc7a83f3f3f3f3f8fccb3
UTF-8 猥띾젻獄띾젿雅먮젗銳띶텩溜뀀젎濡딅젌療귥뼃琯 111001111000110010100101111010111001110110111110111011001010000010111011111001111000110110000100111010111001110110111110111011001010000010111111111010011001101110000101111010111010100010101110111011001010000010010111111010011000101010110011111010111001110110110110111011011000010110101001111011111010011110001011111010111000000010000000111011001010000010001110111001101011111110100001111010111001010010000101111011001010000010001100111011111010011110000001111010101011011110100101111010111011110010000011111001111001000010101111 e78ca5eb9dbeeca0bbe78d84eb9dbeeca0bfe99b85eba8aeeca097e98ab3eb9db6ed85a9efa78beb8080eca08ee6bfa1eb9485eca08cefa781eab7a5ebbc83e790af
UHC 猥띾젻獄띾젿雅먮젗銳띶텩溜뀀젎濡딅젌療귥뼃琯 1110100011100101100011011110101110100000101011101110100010101011100011011110101110100000101100011110010010111010100100001110101110100000100100111110011111100101100011011110010110110110100111011110101011111110101100101110101110100000100011111110101110100001100010101110101110100000100011011110100011111110100000101110110010010110100011011100111010110101 e8e58deba0aee8ab8deba0b1e4ba90eba093e7e58de5b69deafeb2eba08feba18aeba08de8fe82ec968dceb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)