To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 如??吟??渦??[如??吟??渦??[^ 100101000100000000111111001111111000101111100001001111110011111110001001010100010011111100111111010110111001010001000000001111110011111110001011111000010011111100111111100010010101000100111111001111110101101101011110 94403f3f8be13f3f89513f3f5b94403f3f8be13f3f89513f3f5b5e
EUC-JP 如??吟??渦??[如??吟??渦??[^ 110001111010000100111111001111111011011011100011001111110011111110110001101100100011111100111111010110111100011110100001001111110011111110110110111000110011111100111111101100011011001000111111001111110101101101011110 c7a13f3fb6e33f3fb1b23f3f5bc7a13f3fb6e33f3fb1b23f3f5b5e
UTF-8 如볢쓩吟쀦윋渦깆깤[如볢쓩吟쀦윋渦깆깤[^ 111001011010011010000010111010111011001110100010111011001001001110101001111001011001000010011111111011001000000010100110111011001001110010001011111001101011100010100110111010101011100110000110111010101011100110100100010110111110010110100110100000101110101110110011101000101110110010010011101010011110010110010000100111111110110010000000101001101110110010011100100010111110011010111000101001101110101010111001100001101110101010111001101001000101101101011110 e5a682ebb3a2ec93a9e5909fec80a6ec9c8be6b8a6eab986eab9a45be5a682ebb3a2ec93a9e5909fec80a6ec9c8be6b8a6eab986eab9a45b5e
UHC 如볢쓩吟쀦윋渦깆깤[如볢쓩吟쀦윋渦깆깤[^ 111001011111110110010011111010001011111010110001111010111110000110010111111001101001111110010011111010001011111010110001111011001000001110010111010110111110010111111101100100111110100010111110101100011110101111100001100101111110011010011111100100111110100010111110101100011110110010000011100101110101101101011110 e5fd93e8beb1ebe197e69f93e8beb1ec83975be5fd93e8beb1ebe197e69f93e8beb1ec83975b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)