To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????????嶸??????る??? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111111010101101000011111100111111001111110011111100111111001111111000001011101001001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3ffab43f3f3f3f3f3f82e93f3f3f
EUC-JP ????????????嶸??????る??? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000111110111011111101000011111100111111001111110011111100111111001111111010010011101011001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f8fbbf43f3f3f3f3f3fa4eb3f3f3f
UTF-8 溜삳젒溜뷴뙜溜뤿졎溜쀫졋嶸앭짍溜뺣졋溜る졋溜찭 111011111010011110001011111011001000001010110011111011001010000010010010111011111010011110001011111010111011011110110100111010111001100110011100111011111010011110001011111010111010010010111111111011001010000110001110111011111010011110001011111011001000000010101011111011001010000110001011111001011011011010111000111011001001010110101101111011001010011110001101111011111010011110001011111010111011101010100011111011001010000110001011111011111010011110001011111000111000001010001011111011001010000110001011111011111010011110001011111011001011000010101101 efa78bec82b3eca092efa78bebb7b4eb999cefa78beba4bfeca18eefa78bec80abeca18be5b6b8ec95adeca78defa78bebbaa3eca18befa78be3828beca18befa78becb0ad
UHC 溜삳젒溜뷴뙜溜뤿졎溜쀫졋嶸앭짍溜뺣졋溜る졋溜찭 11101010111111101011101111101011101000001001000111101010111111101011101011100101100011001010000111101010111111101000111111101011101000001011101111101010111111101001011111101011101000001011101011100111101011101001110111100101101000111001100111101010111111101001010111101011101000001011101011101010111111101010101011101011101000001011101011101010111111101010101001000101 eafebbeba091eafebae58ca1eafe8feba0bbeafe97eba0bae7ae9de5a399eafe95eba0baeafeaaeba0baeafeaa45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)