To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 蟻μ?宥?????[蟻μ?宥?????[^ 100010110110000110000011110010100011111110010111010001110011111100111111001111110011111100111111010110111000101101100001100000111100101000111111100101110100011100111111001111110011111100111111001111110101101101011110 8b6183ca3f97473f3f3f3f3f5b8b6183ca3f97473f3f3f3f3f5b5e
EUC-JP 蟻μ?宥?????[蟻μ?宥?????[^ 101101011100001010100110110011000011111111001101101010000011111100111111001111110011111100111111010110111011010111000010101001101100110000111111110011011010100000111111001111110011111100111111001111110101101101011110 b5c2a6cc3fcda83f3f3f3f3f5bb5c2a6cc3fcda83f3f3f3f3f5b5e
UTF-8 蟻μ콈宥살옣梨꾨떩[蟻μ콈宥살옣梨꾨떩[^ 11101000100111111011101111001110101111001110110010111101100010001110010110101110101001011110110010000010101101001110110010011000101000111110111110100111101000101110101010111110101010001110101110010110101010010101101111101000100111111011101111001110101111001110110010111101100010001110010110101110101001011110110010000010101101001110110010011000101000111110111110100111101000101110101010111110101010001110101110010110101010010101101101011110 e89fbbcebcecbd88e5aea5ec82b4ec98a3efa7a2eabea8eb96a95be89fbbcebcecbd88e5aea5ec82b4ec98a3efa7a2eabea8eb96a95b5e
UHC 蟻μ콈宥살옣梨꾨떩[蟻μ콈宥살옣梨꾨떩[^ 111010111111110010100101111011001011000110000100111010101110100110111011111011001001111010100101111011001011000110000100111010111000101110111011010110111110101111111100101001011110110010110001100001001110101011101001101110111110110010011110101001011110110010110001100001001110101110001011101110110101101101011110 ebfca5ecb184eae9bbec9ea5ecb184eb8bbb5bebfca5ecb184eae9bbec9ea5ecb184eb8bbb5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)