To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???唯??轅⑥????幼??醫??永??幼? 001111110011111100111111100101110100001000111111001111111110011101110110100001110100010100111111001111110011111100111111100101110110001100111111001111111110011111001110001111110011111110001001011010010011111100111111100101110110001100111111 3f3f3f97423f3fe77687453f3f3f3f97633f3fe7ce3f3f89693f3f97633f
EUC-JP ???唯??轅?????幼??醫??永??幼? 0011111100111111001111111100110110100011001111110011111111101101110101110011111100111111001111110011111100111111110011011100010000111111001111111110111011010000001111110011111110110001110010100011111100111111110011011100010000111111 3f3f3fcda33f3fedd73f3f3f3f3fcdc43f3feed03f3fb1ca3f3fcdc43f
UTF-8 捻뚭엽唯롧솈轅⑥끽捻뚭염幼싧쉽醫묓뮁永띕겳幼쉋 111011111010011010100100111010111001101010101101111011001001011110111101111001011001010010101111111010111010000110100111111011001000011010001000111010001011110110000101111000101001000110100101111010111000000110111101111011111010011010100100111010111001101010101101111011001001011110111100111001011011100110111100111011001000101110100111111011001000100110111101111010011000011010101011111010111010110010010011111010111010111010000001111001101011000010111000111010111001110110010101111010101011001010110011111001011011100110111100111011001000100110001011 efa6a4eb9aadec97bde594afeba1a7ec8688e8bd85e291a5eb81bdefa6a4eb9aadec97bce5b9bcec8ba7ec89bde986abebac93ebae81e6b0b8eb9d95eab2b3e5b9bcec898b
UHC 捻뚭엽唯롧솈轅⑥끽捻뚭염幼싧쉽醫묓뮁永띕겳幼쉋 11100110111101111000110011101010101111111011000111101010111001101000111011100111100110011000110011101010101111111010100011101100101100111010001111100110111101111000110011101010101111111011000011101010111010101001101011100101101111011011000111101100101000101001000111101101100100101001000011100111101101011011011011101011100000011011111111101010111010101001101001100101 e6f78ceabfb1eae68ee7998ceabfa8ecb3a3e6f78ceabfb0eaea9ae5bdb1eca291ed9290e7b5b6eb81bfeaea9a65

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)