To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 娃??梧??釗?????節よ?輿??汚??^ 1000100010100001001111110011111110001100111001100011111100111111111110111011101100111111001111110011111100111111001111111001000011011111100000101110011000111111100101110110000000111111001111111000100110011000001111110011111101011110 88a13f3f8ce63f3ffbbb3f3f3f3f3f90df82e63f97603f3f89983f3f5e
EUC-JP 娃??梧??釗?????節よ?輿??汚??^ 101100001010001100111111001111111011100011101000001111110011111110001111111000111010011000111111001111110011111100111111001111111100000011100001101001001110100000111111110011011100000100111111001111111011000111111000001111110011111101011110 b0a33f3fb8e83f3f8fe3a63f3f3f3f3fc0e1a4e83fcdc13f3fb1f83f3f5e
UTF-8 娃띰쉠梧삥쳛釗녘웺寧좈겤節よ왊輿곣뵵汚녽죳^ 11100101101010001000001111101011100111011011000011101100100010011010000011100110101000101010011111101100100000101010010111101100101100111001101111101001100001111001011111101011100001011001100011101100100110111011101011101111101001101010101011101100101000101000100011101010101100101010010011100111101011111000000011100011100000101000100011101100100110011000101011101000101111001011111111101010101100111010001111101011101101011011010111100110101100011001101011101011100001011011110111101100101000111011001101011110 e5a883eb9db0ec89a0e6a2a7ec82a5ecb39be98797eb8598ec9bbaefa6aaeca288eab2a4e7af80e38288ec998ae8bcbfeab3a3ebb5b5e6b19aeb85bdeca3b35e
UHC 娃띰쉠梧삥쳛釗녘웺寧좈겤節よ왊輿곣뵵汚녽죳^ 11101000110111111011011011101111101111011010101011100111111111001011101111100110101010111000000111100001111100101011001111101000100111111000011011100111101011001010000011101001100000011011011011101111101111011010101011101000100111101011101111100110101010111000000111100010100101001011001111100111111111011000011011101001101000011000111001011110 e8dfb6efbdaae7fcbbe6ab81e1f2b3e89f86e7aca0e981b6efbdaae89ebbe6ab81e294b3e7fd86e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)