To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷?????嗽?5節?5??5旬??徇?+ 100101110101000100111111001111110011111100111111001111111001101001110101001111111000001001010100100100001101111100111111100000100101010000111111001111111000001001010100100011110111101100111111001111111001110001101101001111111000000101111011 97513f3f3f3f3f9a753f825490df3f82543f3f82548f7b3f3f9c6d3f817b
EUC-JP 猷?????嗽?5節?5蓀?5旬??徇?+ 1100110110110010001111110011111100111111001111110011111111010011110101100011111110100011101101011100000011100001001111111010001110110101100011111101100011111000001111111010001110110101101111011101110000111111001111111101011111001110001111111010000111011100 cdb23f3f3f3f3fd3d63fa3b5c0e13fa3b58fd8f83fa3b5bddc3f3fd7ce3fa1dc
UTF-8 猷듐걖呂룔걚嗽뉖5節앸5蓀껊5旬꿩썭徇꾨+ 111001111000110010110111111010111001001110010000111010101011000110010110111011111010011010000000111010111010001110010100111010101011000110011010111001011001011110111101111010111000100110010110111011111011110010010101111001111010111110000000111011001001010110111000111011111011110010010101111010001001001110000000111010101011101110001010111011111011110010010101111001101001011110101100111010101011111110101001111011001000110110101101111001011011111010000111111010101011111010101000111011111011110010001011 e78cb7eb9390eab196efa680eba394eab19ae597bdeb8996efbc95e7af80ec95b8efbc95e89380eabb8aefbc95e697aceabfa9ec8dade5be87eabea8efbc8b
UHC 猷듐걖呂룔걚嗽뉖5節앸5蓀껊5旬꿩썭徇꾨+ 111010111010001110110101111000111000000110000001111001011111101110110111111000111000000110000100111000011111010110000111111010111010001110110101111011111011110110011101111010111010001110110101111000011110000010000011111010111010001110110101111000101110001010110010111001101001101110011101111000101101111110000100111010111010001110101011 eba3b5e38181e5fbb7e38184e1f587eba3b5efbd9deba3b5e1e083eba3b5e2e2b2e69b9de2df84eba3ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)