To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??兀?ⅹ弱??沃???ц?弱??沃??言 10001001010100010011111100111111100110010101100100111111111110100100100110001110111000110011111100111111100101111000000000111111001111110011111110000100100010000011111110001110111000110011111100111111100101111000000000111111001111111000110010111110 89513f3f99593ffa498ee33f3f97803f3f3f84883f8ee33f3f97803f3f8cbe
EUC-JP 渦??兀??弱??沃???ц?弱??沃??言 101100011011001000111111001111111101000110111010001111110011111110111100111001010011111100111111110011011110000000111111001111110011111110100111111010000011111110111100111001010011111100111111110011011110000000111111001111111011100011000000 b1b23f3fd1ba3f3fbce53f3fcde03f3f3fa7e83fbce53f3fcde03f3fb8c0
UTF-8 渦욘릍兀덂ⅹ弱딂떨沃곈걶歷ц꽦弱딂뵷沃곈걶言 1110011010111000101001101110110010011010100110001110101110100110100011011110010110000101100000001110101110001101100000101110001010000101101110011110010110111100101100011110101110010100100000101110101110010110101010001110011010110010100000111110101010110011100010001110101010110001101101101110111110100110100011001101000110000110111010101011110110100110111001011011110010110001111010111001010010000010111010111011010110110111111001101011001010000011111010101011001110001000111010101011000110110110111010001010100010000000 e6b8a6ec9a98eba68de58580eb8d82e285b9e5bcb1eb9482eb96a8e6b283eab388eab1b6efa68cd186eabda6e5bcb1eb9482ebb5b7e6b283eab388eab1b6e8a880
UHC 渦욘릍兀덂ⅹ弱딂떨沃곈걶歷ц꽦弱딂뵷沃곈걶言 1110100010111110101111111110011010111000101011001110100010110100100010001110010110100101101010101110010110110000100010101110100010110110101100111110100010101010101100001110100110000001100111001110011010111000101011001110100010000100101100011110010110110000100010101110100010010100101101011110100010101010101100001110100110000001100111001110010111101011 e8bebfe6b8ace8b488e5a5aae5b08ae8b6b3e8aab0e9819ce6b8ace884b1e5b08ae894b5e8aab0e9819ce5eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)