To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 艾??飮??柔??[艾??飮??柔??[^ 111001001000100000111111001111111001111101011010001111110011111110001111010111110011111100111111010110111110010010001000001111110011111110011111010110100011111100111111100011110101111100111111001111110101101101011110 e4883f3f9f5a3f3f8f5f3f3f5be4883f3f9f5a3f3f8f5f3f3f5b5e
EUC-JP 艾??飮??柔??[艾??飮??柔??[^ 111001111110100000111111001111111101110110111011001111110011111110111101110000000011111100111111010110111110011111101000001111110011111111011101101110110011111100111111101111011100000000111111001111110101101101011110 e7e83f3fddbb3f3fbdc03f3f5be7e83f3fddbb3f3fbdc03f3f5b5e
UTF-8 艾싳궪飮꿰눨柔곗뒾[艾싳궪飮꿰눨柔곗뒾[^ 111010001000100110111110111011001000101110110011111010101011011010101010111010011010001110101110111010101011111110110000111010111000100010101000111001101001111110010100111010101011001110010111111010111001001010111110010110111110100010001001101111101110110010001011101100111110101010110110101010101110100110100011101011101110101010111111101100001110101110001000101010001110011010011111100101001110101010110011100101111110101110010010101111100101101101011110 e889beec8bb3eab6aae9a3aeeabfb0eb88a8e69f94eab397eb92be5be889beec8bb3eab6aae9a3aeeabfb0eb88a8e69f94eab397eb92be5b5e
UHC 艾싳궪飮꿰눨柔곗뒾[艾싳궪飮꿰눨柔곗뒾[^ 111001001111010110011010111011001000001010111100111010111110011010110010111001111000011110111111111010101111010110110000111011001000101010110100010110111110010011110101100110101110110010000010101111001110101111100110101100101110011110000111101111111110101011110101101100001110110010001010101101000101101101011110 e4f59aec82bcebe6b2e787bfeaf5b0ec8ab45be4f59aec82bcebe6b2e787bfeaf5b0ec8ab45b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)