To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???揖??循??n}???揖??循??n{^ 001111110011111100111111100101110100101100111111001111111000111101111010001111110011111101101110011111010011111100111111001111111001011101001011001111110011111110001111011110100011111100111111011011100111101101011110 3f3f3f974b3f3f8f7a3f3f6e7d3f3f3f974b3f3f8f7a3f3f6e7b5e
EUC-JP ???揖??循??n}???揖??循??n{^ 001111110011111100111111110011011010110000111111001111111011110111011011001111110011111101101110011111010011111100111111001111111100110110101100001111110011111110111101110110110011111100111111011011100111101101011110 3f3f3fcdac3f3fbddb3f3f6e7d3f3f3fcdac3f3fbddb3f3f6e7b5e
UTF-8 劣꾧퉵揖좂벀循륁뎵n}劣꾧퉵揖좂벀循륁뎵n{^ 1110111110100110100111011110101010111110101001111110110110001001101101011110011010001111100101101110110010100010100000101110101110110010100000001110010110111110101010101110101110100101100000011110101110001110101101010110111001111101111011111010011010011101111010101011111010100111111011011000100110110101111001101000111110010110111011001010001010000010111010111011001010000000111001011011111010101010111010111010010110000001111010111000111010110101011011100111101101011110 efa69deabea7ed89b5e68f96eca282ebb280e5beaaeba581eb8eb56e7defa69deabea7ed89b5e68f96eca282ebb280e5beaaeba581eb8eb56e7b5e
UHC 劣꾧퉵揖좂벀循륁뎵n}劣꾧퉵揖좂벀循륁뎵n{^ 1110011011101011100001001110101010111001100011011110101111100111101000001110011110010011101001101110001011100000100011111110110010001001100010000110111001111101111001101110101110000100111010101011100110001101111010111110011110100000111001111001001110100110111000101110000010001111111011001000100110001000011011100111101101011110 e6eb84eab98debe7a0e793a6e2e08fec89886e7de6eb84eab98debe7a0e793a6e2e08fec89886e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)