To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??肄?┸儒??繞??唯??儒??癲??逾 10001000101000110011111100111111111000111110010100111111100001001011110110001110111100100011111100111111111000111000010100111111001111111001011101000010001111110011111110001110111100100011111100111111111000011001111100111111001111111110011110100101 88a33f3fe3e53f84bd8ef23f3fe3853f3f97423f3f8ef23f3fe19f3f3fe7a5
EUC-JP 哀??肄?┸儒??繞??唯??儒??癲??逾 10110000101001010011111100111111111001101110011100111111101010001011111110111100111101000011111100111111111001011110010100111111001111111100110110100011001111110011111110111100111101000011111100111111111000101010000100111111001111111110111010100111 b0a53f3fe6e73fa8bfbcf43f3fe5e53f3fcda33f3fbcf43f3fe2a13f3feea7
UTF-8 哀노끀肄됵┸儒삳쐨繞볥쑜唯졾맫儒띠퐠癲용끇逾 111001011001001110000000111010111000010110111000111010111000000110000000111010001000001010000100111010111001000010110101111000101001010010111000111001011000010010010010111011001000001010110011111011001001000010101000111001111011100110011110111010111011001110100101111011001001000110011100111001011001010010101111111011001010000110111110111010111010011110101011111001011000010010010010111010111001110110100000111011011001000010100000111001111001100110110010111011001001101010101001111010111000000110000111111010011000000010111110 e59380eb85b8eb8180e88284eb90b5e294b8e58492ec82b3ec90a8e7b99eebb3a5ec919ce594afeca1beeba7abe58492eb9da0ed90a0e799b2ec9aa9eb8187e980be
UHC 哀노끀肄됵┸儒삳쐨繞볥쑜唯졾맫儒띠퐠癲용끇逾 1110010011101110101100111110101110000101101101101110110010111101100010011110111110100110101111111110101011100011101110111110101110011100100011011110100110100100100100111110101110011100101110111110101011100110101000001110010110010000101100111110101011100011101101101110110010111101100010011110111110100110101111111110101110000101101110111110101110110101 e4eeb3eb85b6ecbd89efa6bfeae3bbeb9c8de9a493eb9cbbeae6a0e590b3eae3b6ecbd89efa6bfeb85bbebb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)