To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 畏??瓦??猥??穩??畏??瓦??猥??穩??^ 100010001101100000111111001111111000101010100010001111110011111111100000110011100011111100111111111000100111001000111111001111111000100011011000001111110011111110001010101000100011111100111111111000001100111000111111001111111110001001110010001111110011111101011110 88d83f3f8aa23f3fe0ce3f3fe2723f3f88d83f3f8aa23f3fe0ce3f3fe2723f3f5e
EUC-JP 畏??瓦??猥??穩??畏??瓦??猥??穩??^ 101100001101101000111111001111111011010010100100001111110011111111100000110100000011111100111111111000111101001100111111001111111011000011011010001111110011111110110100101001000011111100111111111000001101000000111111001111111110001111010011001111110011111101011110 b0da3f3fb4a43f3fe0d03f3fe3d33f3fb0da3f3fb4a43f3fe0d03f3fe3d33f3f5e
UTF-8 畏묕쉠瓦븝슬猥듸슁穩롧퉼畏묕쉠瓦븝슬猥듸슁穩롧퉼^ 11100111100101011000111111101011101011001001010111101100100010011010000011100111100100111010011011101011101110001001110111101100100010101010110011100111100011001010010111101011100100111011100011101100100010101000000111100111101010011010100111101011101000011010011111101101100010011011110011100111100101011000111111101011101011001001010111101100100010011010000011100111100100111010011011101011101110001001110111101100100010101010110011100111100011001010010111101011100100111011100011101100100010101000000111100111101010011010100111101011101000011010011111101101100010011011110001011110 e7958febac95ec89a0e793a6ebb89dec8aace78ca5eb93b8ec8a81e7a9a9eba1a7ed89bce7958febac95ec89a0e793a6ebb89dec8aace78ca5eb93b8ec8a81e7a9a9eba1a7ed89bc5e
UHC 畏묕쉠瓦븝슬猥듸슁穩롧퉼畏묕쉠瓦븝슬猥듸슁穩롧퉼^ 11101000111001101001000111101111101111011010101011101000101111111011101011101111101111011011110111101000111001011011010111101111101111011011001111101000101100011000111011100111101110011001010011101000111001101001000111101111101111011010101011101000101111111011101011101111101111011011110111101000111001011011010111101111101111011011001111101000101100011000111011100111101110011001010001011110 e8e691efbdaae8bfbaefbdbde8e5b5efbdb3e8b18ee7b994e8e691efbdaae8bfbaefbdbde8e5b5efbdb3e8b18ee7b9945e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)