To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 渦????Ⅹ幼??永??渦????Ⅹ幼??永??B 100010010101000100111111001111110011111100111111100001110101110110010111011000110011111100111111100010010110100100111111001111111000100101010001001111110011111100111111001111111000011101011101100101110110001100111111001111111000100101101001001111110011111101000010 89513f3f3f3f875d97633f3f89693f3f89513f3f3f3f875d97633f3f89693f3f42
EUC-JP 渦?????幼??永??渦?????幼??永??B 10110001101100100011111100111111001111110011111100111111110011011100010000111111001111111011000111001010001111110011111110110001101100100011111100111111001111110011111100111111110011011100010000111111001111111011000111001010001111110011111101000010 b1b23f3f3f3f3fcdc43f3fb1ca3f3fb1b23f3f3f3f3fcdc43f3fb1ca3f3f42
UTF-8 渦깆엺栒곻Ⅹ幼먰뭴永띠뿝渦깆엺栒곻Ⅹ幼먰뭴永띠뿝B 11100110101110001010011011101010101110011000011011101100100101111011101011100110101000001001001011101010101100111011101111100010100001011010100111100101101110011011110011101011101010001011000011101011101011011011010011100110101100001011100011101011100111011010000011101011101111111001110111100110101110001010011011101010101110011000011011101100100101111011101011100110101000001001001011101010101100111011101111100010100001011010100111100101101110011011110011101011101010001011000011101011101011011011010011100110101100001011100011101011100111011010000011101011101111111001110101000010 e6b8a6eab986ec97bae6a092eab3bbe285a9e5b9bceba8b0ebadb4e6b0b8eb9da0ebbf9de6b8a6eab986ec97bae6a092eab3bbe285a9e5b9bceba8b0ebadb4e6b0b8eb9da0ebbf9d42
UHC 渦깆엺栒곻Ⅹ幼먰뭴永띠뿝渦깆엺栒곻Ⅹ幼먰뭴永띠뿝B 11101000101111101011000111101100100111101000110011100010111000111000000111101111101001011011100111101010111010101001000011101101100100101000001111100111101101011011011011101100100101111001111111101000101111101011000111101100100111101000110011100010111000111000000111101111101001011011100111101010111010101001000011101101100100101000001111100111101101011011011011101100100101111001111101000010 e8beb1ec9e8ce2e381efa5b9eaea90ed9283e7b5b6ec979fe8beb1ec9e8ce2e381efa5b9eaea90ed9283e7b5b6ec979f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)