To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??逸??音??永??異??恝揖??喩?? 111000011001111100111111001111111000100011101101001111110011111110001001101110010011111100111111100010010110100100111111001111111000100011011001001111110011111111111010101111001001011101001011001111110011111110011010011001110011111100111111 e19f3f3f88ed3f3f89b93f3f89693f3f88d93f3ffabc974b3f3f9a673f3f
EUC-JP 癲??逸??音??永??異??恝揖??喩?? 11100010101000010011111100111111101100001110111100111111001111111011001010111011001111110011111110110001110010100011111100111111101100001101101100111111001111111000111110111101111001111100110110101100001111110011111111010011110010000011111100111111 e2a13f3fb0ef3f3fb2bb3f3fb1ca3f3fb0db3f3f8fbde7cdac3f3fd3c83f3f
UTF-8 癲뚯눦逸띈쵟音쎌댉永띕뜂異득럩恝揖덄독喩쏆뒾 111001111001100110110010111010111001101010101111111010111000100010100110111010011000000010111000111010111001110110001000111011001011010110011111111010011001111110110011111011001000111010001100111010111000110010001001111001101011000010111000111010111001110110010101111010111001110010000010111001111001010110110000111010111001001110011101111010111001111110101001111001101000000110011101111001101000111110010110111010111000110110000100111010111000111110000101111001011001011010101001111011001000111110000110111010111001001010111110 e799b2eb9aafeb88a6e980b8eb9d88ecb59fe99fb3ec8e8ceb8c89e6b0b8eb9d95eb9c82e795b0eb939deb9fa9e6819de68f96eb8d84eb8f85e596a9ec8f86eb92be
UHC 癲뚯눦逸띈쵟音쎌댉永띕뜂異득럩恝揖덄독喩쏆뒾 1110111110100110100011001110110010000111101111011110110011101111101101101110100010101100101000001110101111100101101111011110110010001000101100101110011110110101101101101110101110001101100001101110110010110110101101011110011010001110100011001100111010111111111010111110011110001000111001111011010110110110111010101110011110011011111011001000101010110100 efa68cec87bdecefb6e8aca0ebe5bdec88b2e7b5b6eb8d86ecb6b5e68e8ccebfebe788e7b5b6eae79bec8ab4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)