To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 渦???ц?蘊??v渦???ц?蘊??vB 100010010101000100111111001111110011111110000100100010000011111111100101010111010011111100111111011101101000100101010001001111110011111100111111100001001000100000111111111001010101110100111111001111110111011001000010 89513f3f3f84883fe55d3f3f7689513f3f3f84883fe55d3f3f7642
EUC-JP 渦???ц?蘊??v渦???ц?蘊??vB 101100011011001000111111001111110011111110100111111010000011111111101001101111100011111100111111011101101011000110110010001111110011111100111111101001111110100000111111111010011011111000111111001111110111011001000010 b1b23f3f3fa7e83fe9be3f3f76b1b23f3f3fa7e83fe9be3f3f7642
UTF-8 渦곈콌歷ц꽦蘊듾퓱v渦곈콌歷ц꽦蘊듾퓱vB 11100110101110001010011011101010101100111000100011101100101111011000110011101111101001101000110011010001100001101110101010111101101001101110100010011000100010101110101110010011101111101110110110010011101100010111011011100110101110001010011011101010101100111000100011101100101111011000110011101111101001101000110011010001100001101110101010111101101001101110100010011000100010101110101110010011101111101110110110010011101100010111011001000010 e6b8a6eab388ecbd8cefa68cd186eabda6e8988aeb93beed93b176e6b8a6eab388ecbd8cefa68cd186eabda6e8988aeb93beed93b17642
UHC 渦곈콌歷ц꽦蘊듾퓱v渦곈콌歷ц꽦蘊듾퓱vB 111010001011111010110000111010011011000110001000111001101011100010101100111010001000010010110001111010001011001110001010111001001011111110010111011101101110100010111110101100001110100110110001100010001110011010111000101011001110100010000100101100011110100010110011100010101110010010111111100101110111011001000010 e8beb0e9b188e6b8ace884b1e8b38ae4bf9776e8beb0e9b188e6b8ace884b1e8b38ae4bf977642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)