To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??唯??儒??瑤??揖η?柔る┸夭??椅 111000011001111100111111001111111001011101000010001111110011111110001110111100100011111100111111111010101010001000111111001111111001011101001011100000111100010100111111100011110101111110000010111010011000010010111101100110101110111000111111001111111000100011010110 e19f3f3f97423f3f8ef23f3feaa23f3f974b83c53f8f5f82e984bd9aee3f3f88d6
EUC-JP 癲??唯??儒??瑤??揖η?柔る┸夭??椅 111000101010000100111111001111111100110110100011001111110011111110111100111101000011111100111111111101001010010000111111001111111100110110101100101001101100011100111111101111011100000010100100111010111010100010111111110101001111000000111111001111111011000011011000 e2a13f3fcda33f3fbcf43f3ff4a43f3fcdaca6c73fbdc0a4eba8bfd4f03f3fb0d8
UTF-8 癲뗣꺃唯섆풌儒룸왂瑤녠퉫揖η춯柔る┸夭곗떒椅 1110011110011001101100101110101110010111101000111110101010111010100000111110010110010100101011111110110010000100100001101110110110010010100011001110010110000100100100101110101110100011101110001110110010011001100000101110011110010001101001001110101110000101101000001110110110001001101010111110011010001111100101101100111010110111111011001011011010101111111001101001111110010100111000111000001010001011111000101001010010111000111001011010010010101101111010101011001110010111111010111001011010010010111001101010010010000101 e799b2eb97a3eaba83e594afec8486ed928ce58492eba3b8ec9982e791a4eb85a0ed89abe68f96ceb7ecb6afe69f94e3828be294b8e5a4adeab397eb9692e6a485
UHC 癲뗣꺃唯섆풌儒룸왂瑤녠퉫揖η춯柔る┸夭곗떒椅 1110111110100110100010111110001110000011101011001110101011100110100110001110010010111110100100011110101011100011101101111110101110011110101101011110100011111101101100111110101010111001100000111110101111100111101001011110011110101101100011001110101011110101101010101110101110100110101111111110100011101100101100001110110010001011101010001110101111110101 efa68be383aceae698e4be91eae3b7eb9eb5e8fdb3eab983ebe7a5e7ad8ceaf5aaeba6bfe8ecb0ec8ba8ebf5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)