To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣▽?蹂??艶c?猷η?膺??裔 00111111001111110011111110001011100000111000000110100100001111111110011011111000001111110011111110001001100100001000001010000011001111111001011101010001100000111100010100111111111001000101111000111111001111111110010111100001 3f3f3f8b8381a43fe6f83f3f899082833f975183c53fe45e3f3fe5e1
EUC-JP ???泣▽?蹂??艶c?猷η?膺??裔 00111111001111110011111110110101111000111010001010100110001111111110110011111010001111110011111110110001111100001010001111100011001111111100110110110010101001101100011100111111111001111011111100111111001111111110101011100011 3f3f3fb5e3a2a63fecfa3f3fb1f0a3e33fcdb2a6c73fe7bf3f3feae3
UTF-8 捻꿔끇泣▽슭蹂잙큶艶c끇猷η뛾膺쇰짃裔 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010010110101111011110110010001010101011011110100010111001100000101110110010011110100110011110110110000001101101101110100010001001101101101110111110111101100000111110101110000001100001111110011110001100101101111100111010110111111010111001101110111110111010001000011010111010111011001000011110110000111011001010011110000011111010001010001110010100 efa6a4eabf94eb8187e6b3a3e296bdec8aade8b982ec9e99ed81b6e889b6efbd83eb8187e78cb7ceb7eb9bbee886baec87b0eca783e8a394
UHC 捻꿔끇泣▽슭蹂잙큶艶c끇猷η뛾膺쇰짃裔 1110011011110111101100101110001110000101101110111110101111101000101000011110010010111101101111101110101110110011100111111110101110110100100001011110011011111101101000111110001110000101101110111110101110100011101001011110011110001101100001001110101111101100101111001110101110100011100100111110011111100000 e6f7b2e385bbebe8a1e4bdbeebb39febb485e6fda3e385bbeba3a5e78d84ebecbceba393e7e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)