To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥????????循??畑??一?? 111001001000100000111111001111111110001010000110001111110011111110010111010001110011111100111111001111110011111100111111001111110011111100111111100011110111101000111111001111111001010010101000001111110011111110001000111010100011111100111111 e4883f3fe2863f3f97473f3f3f3f3f3f3f3f8f7a3f3f94a83f3f88ea3f3f
EUC-JP 艾??竊??宥????????循??畑??一?? 111001111110100000111111001111111110001111100110001111110011111111001101101010000011111100111111001111110011111100111111001111110011111100111111101111011101101100111111001111111100100010101010001111110011111110110000111011000011111100111111 e7e83f3fe3e63f3fcda83f3f3f3f3f3f3f3fbddb3f3fc8aa3f3fb0ec3f3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺劉졾쑵循뗰폊畑우쉯一뜻벚 111010001000100110111110111011001000111010001000111010111000000110001111111001111010101110001010111010111011110110101000111011011000101110100000111001011010111010100101111010111000101110111111111011001011111110000101111011111010011010011100111010111010001110010100111010101011100110111010111011111010011110000111111011001010000110111110111011001001000110110101111001011011111010101010111010111001011110110000111011011000111110001010111001111001010110010001111011001001101010110000111011001000100110101111111001001011100010000000111010111001110010111011111010111011001010011010 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9baefa787eca1beec91b5e5beaaeb97b0ed8f8ae79591ec9ab0ec89afe4b880eb9cbbebb29a
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺劉졾쑵循뗰폊畑우쉯一뜻벚 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010101110010110100000111001011011111010101010111000101110000010001011111011111011110010010101111011111010010110111111111011001001101010000111111011001110100110110110111001101011101010100010 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6eae5a0e5beaae2e08befbc95efa5bfec9a87ece9b6e6baa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)