To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??節??蘖??節ц?厓??殃??敖?? 1111101111000100001111110011111110010000110111110011111100111111100111110101000000111111001111111001000011011111100001001000100000111111111110101000110100111111001111111001111101101001001111110011111110011101110000100011111100111111 fbc43f3f90df3f3f9f503f3f90df84883ffa8d3f3f9f693f3f9dc23f3f
EUC-JP 鈺??節??蘖??節ц?厓??殃??敖?? 10001111111000111101010100111111001111111100000011100001001111110011111111011101101100010011111100111111110000001110000110100111111010000011111110001111101101001100011100111111001111111101110111001010001111110011111111011010110001000011111100111111 8fe3d53f3fc0e13f3fddb13f3fc0e1a7e83f8fb4c73f3fddca3f3fdac43f3f
UTF-8 鈺싧룿節삣똻蘖쀨쾫節ц춾厓곤슥殃곫릹敖쏉슈 1110100110001000101110101110110010001011101001111110101110100011101111111110011110101111100000001110110010000010101000111110101110011000101110111110100010011000100101101110110010000000101010001110110010111110101010111110011110101111100000001101000110000110111011001011011010111110111001011000111010010011111010101011001110100100111011001000101010100101111001101010111010000011111010101011001110101011111010111010011010111001111001101001010110010110111011001000111110001001111011001000101010001000 e988baec8ba7eba3bfe7af80ec82a3eb98bbe89896ec80a8ecbeabe7af80d186ecb6bee58e93eab3a4ec8aa5e6ae83eab3abeba6b9e69596ec8f89ec8a88
UHC 鈺싧룿節삣똻蘖쀨쾫節ц춾厓곤슥殃곫릹敖쏉슈 111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001011110111010010111111010001011001010000010111011111011110110101100111010001010110110011010111001001110110110110000111011111011110110111011111001001110101010000001111001101001000010010111111001111111100110011011111011111011110110110100 e8ad9ae58fb0efbdbbe58c81e5ee97e8b282efbdace8ad9ae4edb0efbdbbe4ea81e69097e7f99befbdb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)