To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??鍮???る?獄??唯g?惟???κ? 1000100101010001001111110011111111101000010010100011111100111111001111111000001011101001001111111000110110010110001111110011111110010111010000101000001010000111001111111000100011010010001111110011111100111111100000111100100000111111 89513f3fe84a3f3f3f82e93f8d963f3f974282873f88d23f3f3f83c83f
EUC-JP 渦??鍮???る?獄??唯g?惟???κ? 1011000110110010001111110011111111101111101010110011111100111111001111111010010011101011001111111011100111110110001111110011111111001101101000111010001111100111001111111011000011010100001111110011111100111111101001101100101000111111 b1b23f3fefab3f3f3fa4eb3fb9f63f3fcda3a3e73fb0d43f3f3fa6ca3f
UTF-8 渦기뫁鍮뽭씣戮る연獄쎼룗唯g뙠惟곗뵞若κ퓖 1110011010111000101001101110101010111000101100001110101110101011100000011110100110001101101011101110101110111101101011011110110010010100101000111110111110100111100100101110001110000010100010111110110010010111101100001110011110001101100001001110110010001110101111001110101110100011100101111110010110010100101011111110111110111101100001111110101110011001101000001110011010000011100111111110101010110011100101111110101110110101100111101110111110100101101101001100111010111010111011011001001110010110 e6b8a6eab8b0ebab81e98daeebbdadec94a3efa792e3828bec97b0e78d84ec8ebceba397e594afefbd87eb99a0e6839feab397ebb59eefa5b4cebaed9396
UHC 渦기뫁鍮뽭씣戮る연獄쎼룗唯g뙠惟곗뵞若κ퓖 111010001011111010110001111000101001000110100101111010111011100110010110111010011001110110110111111010111011110110101010111010111011111110101100111010001010101110011011111000111000111110010011111010101110011010100011111001111000110010100101111010101110111010110000111011001001010010011110111001011010111010100101111010101011111110000001 e8beb1e291a5ebb996e99db7ebbdaaebbface8ab9be38f93eae6a3e78ca5eaeeb0ec949ee5aea5eabf81

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)