To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??兀?ⅹ弱??沃???ц?弱??沃?? 1000100101010001001111110011111110011001010110010011111111111010010010011000111011100011001111110011111110010111100000000011111100111111001111111000010010001000001111111000111011100011001111110011111110010111100000000011111100111111 89513f3f99593ffa498ee33f3f97803f3f3f84883f8ee33f3f97803f3f
EUC-JP 渦??兀??弱??沃???ц?弱??沃?? 10110001101100100011111100111111110100011011101000111111001111111011110011100101001111110011111111001101111000000011111100111111001111111010011111101000001111111011110011100101001111110011111111001101111000000011111100111111 b1b23f3fd1ba3f3fbce53f3fcde03f3f3fa7e83fbce53f3fcde03f3f
UTF-8 渦욘릍兀덂ⅹ弱딂떨沃곈걶歷ц꽦弱딂뵷沃곈걶 1110011010111000101001101110110010011010100110001110101110100110100011011110010110000101100000001110101110001101100000101110001010000101101110011110010110111100101100011110101110010100100000101110101110010110101010001110011010110010100000111110101010110011100010001110101010110001101101101110111110100110100011001101000110000110111010101011110110100110111001011011110010110001111010111001010010000010111010111011010110110111111001101011001010000011111010101011001110001000111010101011000110110110 e6b8a6ec9a98eba68de58580eb8d82e285b9e5bcb1eb9482eb96a8e6b283eab388eab1b6efa68cd186eabda6e5bcb1eb9482ebb5b7e6b283eab388eab1b6
UHC 渦욘릍兀덂ⅹ弱딂떨沃곈걶歷ц꽦弱딂뵷沃곈걶 111010001011111010111111111001101011100010101100111010001011010010001000111001011010010110101010111001011011000010001010111010001011011010110011111010001010101010110000111010011000000110011100111001101011100010101100111010001000010010110001111001011011000010001010111010001001010010110101111010001010101010110000111010011000000110011100 e8bebfe6b8ace8b488e5a5aae5b08ae8b6b3e8aab0e9819ce6b8ace884b1e5b08ae894b5e8aab0e9819c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)