To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 嚥?????擬??[嚥?????擬??[^ 10011010100010110011111100111111001111110011111100111111100010110101101100111111001111110101101110011010100010110011111100111111001111110011111100111111100010110101101100111111001111110101101101011110 9a8b3f3f3f3f3f8b5b3f3f5b9a8b3f3f3f3f3f8b5b3f3f5b5e
EUC-JP 嚥??璵??擬??[嚥??璵??擬??[^ 1101001111101011001111110011111110001111110011001110011000111111001111111011010110111100001111110011111101011011110100111110101100111111001111111000111111001100111001100011111100111111101101011011110000111111001111110101101101011110 d3eb3f3f8fcce63f3fb5bc3f3f5bd3eb3f3f8fcce63f3fb5bc3f3f5b5e
UTF-8 嚥좎뼵璵쀨펹擬멩쪞[嚥좎뼵璵쀨펹擬멩쪞[^ 111001011001101010100101111011001010001010001110111010111011110010110101111001111001001010110101111011001000000010101000111011011000111010111001111001101001001110101100111010111010100110101001111011001010101010011110010110111110010110011010101001011110110010100010100011101110101110111100101101011110011110010010101101011110110010000000101010001110110110001110101110011110011010010011101011001110101110101001101010011110110010101010100111100101101101011110 e59aa5eca28eebbcb5e792b5ec80a8ed8eb9e693aceba9a9ecaa9e5be59aa5eca28eebbcb5e792b5ec80a8ed8eb9e693aceba9a9ecaa9e5b5e
UHC 嚥좎뼵璵쀨펹擬멩쪞[嚥좎뼵璵쀨펹擬멩쪞[^ 111001101011111110100000111011001001011010111000111001101010010110010111111010001011110010001001111010111111010010111000111001101010010110010111010110111110011010111111101000001110110010010110101110001110011010100101100101111110100010111100100010011110101111110100101110001110011010100101100101110101101101011110 e6bfa0ec96b8e6a597e8bc89ebf4b8e6a5975be6bfa0ec96b8e6a597e8bc89ebf4b8e6a5975b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)