To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???橈??仰??v???橈??仰??vB 00111111001111110011111110011110111101000011111100111111100010111100001000111111001111110111011000111111001111110011111110011110111101000011111100111111100010111100001000111111001111110111011001000010 3f3f3f9ef43f3f8bc23f3f763f3f3f9ef43f3f8bc23f3f7642
EUC-JP ???橈??仰??v???橈??仰??vB 00111111001111110011111111011100111101100011111100111111101101101100010000111111001111110111011000111111001111110011111111011100111101100011111100111111101101101100010000111111001111110111011001000010 3f3f3fdcf63f3fb6c43f3f763f3f3fdcf63f3fb6c43f3f7642
UTF-8 凉붾졁橈쎈젪仰뜻땼v凉붾졁橈쎈젪仰뜻땼vB 111011111010010110111001111010111011011010111110111011001010000110000001111001101010100110001000111011001000111010001000111011001010000010101010111001001011101110110000111010111001110010111011111010111001010110111100011101101110111110100101101110011110101110110110101111101110110010100001100000011110011010101001100010001110110010001110100010001110110010100000101010101110010010111011101100001110101110011100101110111110101110010101101111000111011001000010 efa5b9ebb6beeca181e6a988ec8e88eca0aae4bbb0eb9cbbeb95bc76efa5b9ebb6beeca181e6a988ec8e88eca0aae4bbb0eb9cbbeb95bc7642
UHC 凉붾졁橈쎈젪仰뜻땼v凉붾졁橈쎈젪仰뜻땼vB 111001011011110010010100111010111010000010110010111010001111101010111101111010111010000010100010111001001110011010110110111001101000101110010010011101101110010110111100100101001110101110100000101100101110100011111010101111011110101110100000101000101110010011100110101101101110011010001011100100100111011001000010 e5bc94eba0b2e8fabdeba0a2e4e6b6e68b9276e5bc94eba0b2e8fabdeba0a2e4e6b6e68b927642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)