To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 要??????⑥?筌??議??音⑤?娃 1001011101110110001111110011111100111111001111110011111100111111100001110100010100111111111000101010001100111111001111111000101101100011001111110011111110001001101110011000011101000100001111111000100010100001 97763f3f3f3f3f3f87453fe2a33f3f8b633f3f89b987443f88a1
EUC-JP 要??沅??洹??筌??議??音??娃 11001101110101110011111100111111100011111100011011101001001111110011111110001111110001111011101000111111001111111110010010100101001111110011111110110101110001000011111100111111101100101011101100111111001111111011000010100011 cdd73f3f8fc6e93f3f8fc7ba3f3fe4a53f3fb5c43f3fb2bb3f3fb0a3
UTF-8 要쏅뀈沅쀯쭪洹⑥춷筌먲퐣議녶젾音⑤큸娃 111010001010011010000001111011001000111110000101111010111000000010001000111001101011001010000101111011001000000010101111111011001010110110101010111001101011010010111001111000101001000110100101111011001011011010110111111001111010110110001100111010111010100010110010111011011001000010100011111010001010110110110000111010111000010110110110111011001010000010111110111010011001111110110011111000101001000110100100111011011000000110111000111001011010100010000011 e8a681ec8f85eb8088e6b285ec80afecadaae6b4b9e291a5ecb6b7e7ad8ceba8b2ed90a3e8adb0eb85b6eca0bee99fb3e291a4ed81b8e5a883
UHC 要쏅뀈沅쀯쭪洹⑥춷筌먲퐣議녶젾音⑤큸娃 1110100110101001100110111110101110000101100001001110101010110110100101111110111110100111100111101110101010110111101010001110110010101101100100111110111110100111100100001110111110111101100011001110110010100001100001101110010110100000101100001110101111100101101010001110101110110100100001111110100011011111 e9a99beb8584eab697efa79eeab7a8ecad93efa790efbd8ceca186e5a0b0ebe5a8ebb487e8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)