To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???幼?壤??鎰??韋?????唯??議?? 0011111100111111001111111001011101100011001111111001101011011111001111110011111111101000010011000011111100111111111010001110100000111111001111110011111100111111001111111001011101000010001111110011111110001011011000110011111100111111 3f3f3f97633f9adf3f3fe84c3f3fe8e83f3f3f3f3f97423f3f8b633f3f
EUC-JP ???幼?壤??鎰??韋?????唯??議?? 0011111100111111001111111100110111000100001111111101010011100001001111110011111111101111101011010011111100111111111100001110101000111111001111110011111100111111001111111100110110100011001111110011111110110101110001000011111100111111 3f3f3fcdc43fd4e13f3fefad3f3ff0ea3f3f3f3f3fcda33f3fb5c43f3f
UTF-8 捻뚭염幼쉇壤깆쥜鎰쏁독韋블닂捻뚭엽唯롥쉽議얠쑠 111011111010011010100100111010111001101010101101111011001001011110111100111001011011100110111100111011001000100110000111111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111011001000111110000001111010111000111110000101111010011001111110001011111010111011100010010100111010111000101110000010111011111010011010100100111010111001101010101101111011001001011110111101111001011001010010101111111010111010000110100101111011001000100110111101111010001010110110110000111011001001011010100000111011001001000110100000 efa6a4eb9aadec97bce5b9bcec8987e5a3a4eab986eca59ce98eb0ec8f81eb8f85e99f8bebb894eb8b82efa6a4eb9aadec97bde594afeba1a5ec89bde8adb0ec96a0ec91a0
UHC 捻뚭염幼쉇壤깆쥜鎰쏁독韋블닂捻뚭엽唯롥쉽議얠쑠 11100110111101111000110011101010101111111011000011101010111010101001101001100010111001011011110110110001111011001010001010010001111011001111000010011011111001111011010110110110111010101101111110111010111011011000100010001011111001101111011110001100111010101011111110110001111010101110011010001110111001011011110110110001111011001010000110111110111011001001110010111111 e6f78ceabfb0eaea9a62e5bdb1eca291ecf09be7b5b6eadfbaed888be6f78ceabfb1eae68ee5bdb1eca1beec9cbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)