To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????h??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????蘂?h?????????蘂? 00111111001111110011111100111111001111110011111100111111001111110011111111100101010000010011111101101000001111110011111100111111001111110011111100111111001111110011111100111111111001010100000100111111 3f3f3f3f3f3f3f3f3fe5413f683f3f3f3f3f3f3f3f3fe5413f
EUC-JP ???瑗?????蘂?h???瑗?????蘂? 0011111100111111001111111000111111001100110000000011111100111111001111110011111100111111111010011010001000111111011010000011111100111111001111111000111111001100110000000011111100111111001111110011111100111111111010011010001000111111 3f3f3f8fccc03f3f3f3f3fe9a23f683f3f3f8fccc03f3f3f3f3fe9a23f
UTF-8 溜삠뀛瑗썬뀛麗묆뀛蘂긌h溜삠뀛瑗썬뀛麗묆뀛蘂긌 11101111101001111000101111101100100000101010000011101011100000001001101111100111100100011001011111101100100011011010110011101011100000001001101111101111101001101000100011101011101011001000011011101011100000001001101111101000100110001000001011101010101110001000110001101000111011111010011110001011111011001000001010100000111010111000000010011011111001111001000110010111111011001000110110101100111010111000000010011011111011111010011010001000111010111010110010000110111010111000000010011011111010001001100010000010111010101011100010001100 efa78bec82a0eb809be79197ec8daceb809befa688ebac86eb809be89882eab88c68efa78bec82a0eb809be79197ec8daceb809befa688ebac86eb809be89882eab88c
UHC 溜삠뀛瑗썬뀛麗묆뀛蘂긌h溜삠뀛瑗썬뀛麗묆뀛蘂긌 111010101111111010111011111000111000010110010100111010101011110010111101111000111000010110010100111001101011000010010001111000111000010110010100111001111101111010000011010011000110100011101010111111101011101111100011100001011001010011101010101111001011110111100011100001011001010011100110101100001001000111100011100001011001010011100111110111101000001101001100 eafebbe38594eabcbde38594e6b091e38594e7de834c68eafebbe38594eabcbde38594e6b091e38594e7de834c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)