To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 嚥??二??惟?8}嚥??二??惟?8{^ 1001101010001011001111110011111110010011111100010011111100111111100010001101001000111111100000100101011101111101100110101000101100111111001111111001001111110001001111110011111110001000110100100011111110000010010101110111101101011110 9a8b3f3f93f13f3f88d23f82577d9a8b3f3f93f13f3f88d23f82577b5e
EUC-JP 嚥??二??惟?8}嚥??二??惟?8{^ 1101001111101011001111110011111111000110111100110011111100111111101100001101010000111111101000111011100001111101110100111110101100111111001111111100011011110011001111110011111110110000110101000011111110100011101110000111101101011110 d3eb3f3fc6f33f3fb0d43fa3b87dd3eb3f3fc6f33f3fb0d43fa3b87b5e
UTF-8 嚥↔퍓二삥에惟곗8}嚥↔퍓二삥에惟곗8{^ 111001011001101010100101111000101000011010010100111011011000110110010011111001001011101010001100111011001000001010100101111011001001011110010000111001101000001110011111111010101011001110010111111011111011110010011000011111011110010110011010101001011110001010000110100101001110110110001101100100111110010010111010100011001110110010000010101001011110110010010111100100001110011010000011100111111110101010110011100101111110111110111100100110000111101101011110 e59aa5e28694ed8d93e4ba8cec82a5ec9790e6839feab397efbc987de59aa5e28694ed8d93e4ba8cec82a5ec9790e6839feab397efbc987b5e
UHC 嚥↔퍓二삥에惟곗8}嚥↔퍓二삥에惟곗8{^ 111001101011111110100001111010101011101110001010111011001010001110111011111001101011111110100001111010101110111010110000111011001010001110111000011111011110011010111111101000011110101010111011100010101110110010100011101110111110011010111111101000011110101011101110101100001110110010100011101110000111101101011110 e6bfa1eabb8aeca3bbe6bfa1eaeeb0eca3b87de6bfa1eabb8aeca3bbe6bfa1eaeeb0eca3b87b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)