To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?踰??音?????游??濡???? 1110000110011111100000111000101100111111111001101111101000111111001111111000100110111001001111110011111100111111001111110011111110011111111000000011111100111111100101000100011100111111001111110011111100111111 e19f838b3fe6fa3f3f89b93f3f3f3f3f9fe03f3f94473f3f3f3f
EUC-JP 癲ル?踰??音?????游??濡???? 1110001010100001101001011110101100111111111011001111110000111111001111111011001010111011001111110011111100111111001111110011111111011110111000100011111100111111110001111010100000111111001111110011111100111111 e2a1a5eb3fecfc3f3fb2bb3f3f3f3f3fdee23f3fc7a83f3f3f3f
UTF-8 癲ル슢踰됪룚音좉턀呂얠뜫游룬뼸濡덉눛留쵢 111001111001100110110010111000111000001110101011111011001000101010100010111010001011100010110000111010111001000010101010111010111010001110011010111010011001111110110011111011001010001010001001111011011000010010000000111011111010011010000000111011001001011010100000111010111001110010101011111001101011100010111000111010111010001110101100111010111011110010111000111001101011111110100001111010111000110110001001111010111000100010011011111011111010011110001101111011001011010110100010 e799b2e383abec8aa2e8b8b0eb90aaeba39ae99fb3eca289ed8480efa680ec96a0eb9cabe6b8b8eba3acebbcb8e6bfa1eb8d89eb889befa78decb5a2
UHC 癲ル슢踰됪룚音좉턀呂얠뜫游룬뼸濡덉눛留쵢 11101111101001101010101111101011100110101010111011101011101100101000100111100110100011111001011011101011111001011010000011101010101101011001110011100101111110111011111011101100100011011010110011101010111111011011011111101001100101101011101111101011101000011000100011101100100001111011001111101011101001111010110101000010 efa6abeb9aaeebb289e68f96ebe5a0eab59ce5fbbeec8daceafdb7e996bbeba188ec87b3eba7ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)