To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 辰孫存谷臓続狸即卒奪尊存竪綻続狸即卒B 10010010010000111001000110110111100100011011011010010010010010101001000110011111100100011011000110010010010010111001000110100110100100011011001010010010010001001001000110111000100100011011011010010010010001111001001001011101100100011011000110010010010010111001000110100110100100011011001001000010 924391b791b6924a919f91b1924b91a691b2924491b891b69247925d91b1924b91a691b242
EUC-JP 辰孫存谷臓続狸即卒奪尊存竪綻続狸即卒B 11000011101001001100001010111001110000101011100011000011101010111100001010100001110000101011001111000011101011001100001010101000110000101011010011000011101001011100001010111010110000101011100011000011101010001100001110111110110000101011001111000011101011001100001010101000110000101011010001000010 c3a4c2b9c2b8c3abc2a1c2b3c3acc2a8c2b4c3a5c2bac2b8c3a8c3bec2b3c3acc2a8c2b442
UTF-8 辰孫存谷臓続狸即卒奪尊存竪綻続狸即卒B 11101000101111101011000011100101101011011010101111100101101011011001100011101000101100001011011111101000100001111001001111100111101101101001101011100111100010111011100011100101100011011011001111100101100011011001001011100101101001011010101011100101101100001000101011100101101011011001100011100111101010111010101011100111101101101011101111100111101101101001101011100111100010111011100011100101100011011011001111100101100011011001001001000010 e8beb0e5adabe5ad98e8b0b7e88793e7b69ae78bb8e58db3e58d92e5a5aae5b08ae5ad98e7abaae7b6bbe7b69ae78bb8e58db3e58d9242
UHC 辰孫存谷??狸?卒奪尊存竪綻?狸?卒B 1111001011100011111000011101110111110000111011011100110111011011001111110011111111010111111000010011111111110000111011111111011110101100111100001110111011110000111011011110001010110101111101111010101000111111110101111110000100111111111100001110111101000010 f2e3e1ddf0edcddb3f3fd7e13ff0eff7acf0eef0ede2b5f7aa3fd7e13ff0ef42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)