To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嗚????㏄幽??如??有??怨??????^ 1001101001101010001111110011111100111111001111111000011101110100100101110100100000111111001111111001010001000000001111110011111110010111010011000011111100111111100010011000010100111111001111110011111100111111001111110011111101011110 9a6a3f3f3f3f877497483f3f94403f3f974c3f3f89853f3f3f3f3f3f5e
EUC-JP 嗚?????幽??如??有??怨??????^ 11010011110010110011111100111111001111110011111100111111110011011010100100111111001111111100011110100001001111110011111111001101101011010011111100111111101100011110010100111111001111110011111100111111001111110011111101011110 d3cb3f3f3f3f3fcda93f3fc7a13f3fcdad3f3fb1e53f3f3f3f3f3f5e
UTF-8 嗚삠굥履뉛㏄幽뚣럹如싲뿥有붺몴怨명떅嶺뚋살굜^ 11100101100101111001101011101100100000101010000011101010101101011010010111101111101001111001111111101011100010011001101111100011100011111000010011100101101110011011110111101011100110101010001111101011100111111011100111100101101001101000001011101100100010111011001011101011101111111010010111100110100111001000100111101011101101101011101011101011101010101011010011100110100000001010100011101011101010101000010111101011100101101000010111101111101001101010101111101011100110101000101111101100100000101011010011101010101101011001110001011110 e5979aec82a0eab5a5efa79feb899be38f84e5b9bdeb9aa3eb9fb9e5a682ec8bb2ebbfa5e69c89ebb6baebaab4e680a8ebaa85eb9685efa6abeb9a8bec82b4eab59c5e
UHC 嗚삠굥履뉛㏄幽뚣럹如싲뿥有붺몴怨명떅嶺뚋살굜^ 111001111111000010111011111000111000001010001011111011001010101010000111111011111010011110100110111010101110101110001100111000111000111010011000111001011111110110011010111010111001011110100101111010101111001110010100111001111001000110011100111010101011001110111000111011011000101110011011111001111010110110001100110011101011101111101100100000101000010001011110 e7f0bbe3828becaa87efa7a6eaeb8ce38e98e5fd9aeb97a5eaf394e7919ceab3b8ed8b9be7ad8ccebbec82845e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)