To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?щ?伍??彦ц??ц????荏??伍??^ 0011111110000100100010110011111110001100110111100011111100111111100101010100011010000100100010000011111100111111100001001000100000111111001111110011111100111111100010010110000000111111001111111000110011011110001111110011111101011110 3f848b3f8cde3f3f954684883f3f84883f3f3f3f89603f3f8cde3f3f5e
EUC-JP ?щ?伍??彦ц?馹ц????荏??伍??^ 00111111101001111110101100111111101110001110000000111111001111111100100110100111101001111110100000111111100011111110100110100001101001111110100000111111001111110011111100111111101100011100000100111111001111111011100011100000001111110011111101011110 3fa7eb3fb8e03f3fc9a7a7e83f8fe9a1a7e83f3f3f3fb1c13f3fb8e03f3f5e
UTF-8 吳щ젿伍곹쓼彦ц똻馹ц뫊溜김똻荏쀦떖伍곹벉^ 11100101100100001011001111010001100010011110110010100000101111111110010010111100100011011110101010110011101110011110110010010011101111001110010110111101101001101101000110000110111010111001100010111011111010011010011010111001110100011000011011101011101010111000101011101111101001111000101111101010101110011000000011101011100110001011101111101000100011011000111111101100100000001010011011101011100101101001011011100100101111001000110111101010101100111011100111101011101100101000100101011110 e590b3d189eca0bfe4bc8deab3b9ec93bce5bda6d186eb98bbe9a6b9d186ebab8aefa78beab980eb98bbe88d8fec80a6eb9696e4bc8deab3b9ebb2895e
UHC 吳щ젿伍곹쓼彦ц똻馹ц뫊溜김똻荏쀦떖伍곹벉^ 11100111111011111010110011101011101000001011000111100111111010101000000111101101100111011001011111100101111010011010110011101000100011001000000111101100111100011010110011101000100100011010110011101010111111101011000111101000100011001000000111101100111110111001011111100110100010111010110011100111111010101000000111101101100100111010110001011110 e7efaceba0b1e7ea81ed9d97e5e9ace88c81ecf1ace891aceafeb1e88c81ecfb97e68bace7ea81ed93ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)