To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汚?????獰??節??言??曜????? 1000100110011000001111110011111100111111001111110011111111100000110101100011111100111111100100001101111100111111001111111000110010111110001111110011111110010111011010100011111100111111001111110011111100111111 89983f3f3f3f3fe0d63f3f90df3f3f8cbe3f3f976a3f3f3f3f3f
EUC-JP 汚?????獰??節??言??曜????? 1011000111111000001111110011111100111111001111110011111111100000110110000011111100111111110000001110000100111111001111111011100011000000001111110011111111001101110010110011111100111111001111110011111100111111 b1f83f3f3f3f3fe0d83f3fc0e13f3fb8c03f3fcdcb3f3f3f3f3f
UTF-8 汚억슬料꿰큾獰앶슲節룬닊言븅뀴曜뱄슘料꿜튋 111001101011000110011010111011001001011010110101111011001000101010101100111011111010011010111110111010101011111110110000111011011000000110111110111001111000110110110000111011001001010110110110111011001000101010110010111001111010111110000000111010111010001110101100111010111000101110001010111010001010100010000000111010111011100010000101111010111000000010110100111001101001101110011100111010111011000110000100111011001000101010011000111011111010011010111110111010101011111110011100111011011000101010001011 e6b19aec96b5ec8aacefa6beeabfb0ed81bee78db0ec95b6ec8ab2e7af80eba3aceb8b8ae8a880ebb885eb80b4e69b9cebb184ec8a98efa6beeabf9ced8a8b
UHC 汚억슬料꿰큾獰앶슲節룬닊言븅뀴曜뱄슘料꿜튋 111001111111110110111110111011111011110110111101111010001111011110110010111001111011010010001011111001111011111010011101111010011001101010111001111011111011110110110111111010011000100010010001111001011110101110111010111010011000010110101010111010001111100010111001111011111011110110110111111010001111011110110010111001001011100110011111 e7fdbeefbdbde8f7b2e7b48be7be9de99ab9efbdb7e98891e5ebbae985aae8f8b9efbdb7e8f7b2e4b99f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)