To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?罕幀悠????宏?罕幀悠????槐^ 001111111110001110100101100110111110101010010111010010010011111100111111001111110011111110001101010001110011111111100011101001011001101111101010100101110100100100111111001111110011111100111111100111101100010101011110 3fe3a59bea97493f3f3f3f8d473fe3a59bea97493f3f3f3f9ec55e
EUC-JP ?罕幀悠???珌宏?罕幀悠???珌槐^ 00111111111001101010011111010110111011001100110110101010001111110011111100111111100011111100101111101100101110011010100000111111111001101010011111010110111011001100110110101010001111110011111100111111100011111100101111101100110111001100011101011110 3fe6a7d6eccdaa3f3f3f8fcbecb9a83fe6a7d6eccdaa3f3f3f8fcbecdcc75e
UTF-8 뤋罕幀悠샘렒뤋珌宏뤋罕幀悠샘렒뤋珌槐^ 11101011101001001000101111100111101111011001010111100101101110011000000011100110100000101010000011101100100000111001100011101011101000001001001011101011101001001000101111100111100011111000110011100101101011101000111111101011101001001000101111100111101111011001010111100101101110011000000011100110100000101010000011101100100000111001100011101011101000001001001011101011101001001000101111100111100011111000110011100110101001111001000001011110 eba48be7bd95e5b980e682a0ec8398eba092eba48be78f8ce5ae8feba48be7bd95e5b980e682a0ec8398eba092eba48be78f8ce6a7905e
UHC 뤋罕幀悠샘렒뤋珌宏뤋罕幀悠샘렒뤋珌槐^ 10001111101110111111100111010110111011111101001111101010111011011011101111111001100011101010011110001111101110111111100110110011110011101101101110001111101110111111100111010110111011111101001111101010111011011011101111111001100011101010011110001111101110111111100110110011110011101101100101011110 8fbbf9d6efd3eaedbbf98ea78fbbf9b3cedb8fbbf9d6efd3eaedbbf98ea78fbbf9b3ced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)