To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 諤ィ?讌諧讎奓・N}諤ィ?讌諧讎奓・N{^ 11100110100000001010100000111111111001101010010111100110011111101110011010100110111110101010000010100101010011100111110111100110100000001010100000111111111001101010010111100110011111101110011010100110111110101010000010100101010011100111101101011110 e680a83fe6a5e67ee6a6faa0a54e7de680a83fe6a5e67ee6a6faa0a54e7b5e
EUC-JP 諤ィ奒讌諧讎奓・N}諤ィ奒讌諧讎奓・N{^ 1110101111100000100011101010100010001111101110001111010011101100101001111110101111011111111011001010100010001111101110001111010110001110101001010100111001111101111010111110000010001110101010001000111110111000111101001110110010100111111010111101111111101100101010001000111110111000111101011000111010100101010011100111101101011110 ebe08ea88fb8f4eca7ebdfeca88fb8f58ea54e7debe08ea88fb8f4eca7ebdfeca88fb8f58ea54e7b5e
UTF-8 諤ィ奒讌諧讎奓・N}諤ィ奒讌諧讎奓・N{^ 1110100010101011101001001110111110111101101010001110010110100101100100101110100010101110100011001110100010101011101001111110100010101110100011101110010110100101100100111110111110111101101001010100111001111101111010001010101110100100111011111011110110101000111001011010010110010010111010001010111010001100111010001010101110100111111010001010111010001110111001011010010110010011111011111011110110100101010011100111101101011110 e8aba4efbda8e5a592e8ae8ce8aba7e8ae8ee5a593efbda54e7de8aba4efbda8e5a592e8ae8ce8aba7e8ae8ee5a593efbda54e7b5e
UHC ????諧???N}????諧???N{^ 0011111100111111001111110011111111111010101100100011111100111111001111110100111001111101001111110011111100111111001111111111101010110010001111110011111100111111010011100111101101011110 3f3f3f3ffab23f3f3f4e7d3f3f3f3ffab23f3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)