To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?る?椅?????嚴щ?油ζ┏遺????? 0011111110000010111010010011111110001000110101100011111100111111001111110011111100111111100110101000111010000100100010110011111110010110111110111000001111000100100001001010110010001000111000100011111100111111001111110011111100111111 3f82e93f88d63f3f3f3f3f9a8e848b3f96fb83c484ac88e23f3f3f3f3f
EUC-JP ?る?椅?????嚴щ?油ζ┏遺????? 0011111110100100111010110011111110110000110110000011111100111111001111110011111100111111110100111110111010100111111010110011111111001100111111011010011011000110101010001010111010110000111001000011111100111111001111110011111100111111 3fa4eb3fb0d83f3f3f3f3fd3eea7eb3fccfda6c6a8aeb0e43f3f3f3f3f
UTF-8 閭る벡椅긺뙴栒뜯뀻嚴щ벤油ζ┏遺용퉾醴븍푻 11101111101001101000011011100011100000101000101111101011101100101010000111100110101001001000010111101010101110001011101011101011100110011011010011100110101000001001001011101011100111001010111111101011100000001011101111100101100110101011010011010001100010011110101110110010101001001110011010110010101110011100111010110110111000101001010010001111111010011000000110111010111011001001101010101001111011011000100110111110111011111010011010110111111010111011100010001101111011011001000110111011 efa686e3828bebb2a1e6a485eab8baeb99b4e6a092eb9cafeb80bbe59ab4d189ebb2a4e6b2b9ceb6e2948fe981baec9aa9ed89beefa6b7ebb88ded91bb
UHC 閭る벡椅긺뙴栒뜯뀻嚴щ벤油ζ┏遺용퉾醴븍푻 111001101010110110101010111010111011101010100100111010111111010110110001111001111000110010110111111000101110001110110110111000101000010110110001111001011111000110101100111010111011101010100101111010101111101010100101111001101010011010101110111010111011011010111111111010111011100110010110111001111110010010111010111010111011111010000111 e6adaaebbaa4ebf5b1e78cb7e2e3b6e285b1e5f1acebbaa5eafaa5e6a6aeebb6bfebb996e7e4baebbe87

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)