To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕?┿筍щ?躍??逸??酉??雅 11100001100111110011111100111111100101110101010000111111100001001011100111100010101000011000010010001011001111111001011011110100001111110011111110001000111011010011111100111111100100111101000100111111001111111000100111101011 e19f3f3f97543f84b9e2a1848b3f96f43f3f88ed3f3f93d13f3f89eb
EUC-JP 癲??裕?┿筍щ?躍??逸??酉??雅 11100010101000010011111100111111110011011011010100111111101010001011101111100100101000111010011111101011001111111100110011110110001111110011111110110000111011110011111100111111110001101101001100111111001111111011001011101101 e2a13f3fcdb53fa8bbe4a3a7eb3fccf63f3fb0ef3f3fc6d33f3fb2ed
UTF-8 癲븍쵉裕낉┿筍щ츉躍녹뮋逸썸뮄酉귥뒛雅 1110011110011001101100101110101110111000100011011110110010110101100010011110100010100011100101011110101110000010100010011110001010010100101111111110011110101101100011011101000110001001111011001011100010001001111010001011101010001101111010111000010110111001111010111010111010001011111010011000000010111000111011001000110110111000111010111010111010000100111010011000010110001001111010101011011110100101111010111001001010011011111010011001101110000101 e799b2ebb88decb589e8a395eb8289e294bfe7ad8dd189ecb889e8ba8deb85b9ebae8be980b8ec8db8ebae84e98589eab7a5eb929be99b85
UHC 癲븍쵉裕낉┿筍щ츉躍녹뮋逸썸뮄酉귥뒛雅 1110111110100110101110101110101110101100100010111110101110101110100001011110111110100110101110111110001011101100101011001110101110101110100001011110010110111000101100111110110010010010100110011110110011101111101111011110011010010010100100111110101110110111100000101110110010001010100110001110010010111010 efa6baebac8bebae85efa6bbe2ecacebae85e5b8b3ec9299ecefbde69293ebb782ec8a98e4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)