To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 顫趣スソ鋺雁ョ夲スエ驛≪隰ウ讎奇スュ 111010001111101010001110111011111011110110111111111001111111101010001010111001011010111010011010111011111011110110110100111010011000001110000001111000011111001010110011111010001010110010110011111001101010011010001010111011111011110110101101 e8fa8eefbdbfe7fa8ae5ae9aefbdb4e98381e1f2b3e8acb3e6a68aefbdad
EUC-JP 顫趣スソ鋺雁ョ夲スエ驛≪?隰ウ讎奇スュ 11110000111111001011110011110001100011101011110110001110101111111110111011111100101101001110011110001110101011101101010011110001100011101011110110001110101101001111000111100011101000101110001100111111111100001010111010001110101100111110110010101000101101001111000110001110101111011000111010101101 f0fcbcf18ebd8ebfeefcb4e78eaed4f18ebd8eb4f1e3a2e33ff0ae8eb3eca8b4f18ebd8ead
UTF-8 顫趣スソ鋺雁ョ夲スエ驛≪隰ウ讎奇スュ 111010011010000110101011111010001011011010100011111011111011110110111101111011111011110110111111111010011000101110111010111010011001101110000001111011111011110110101110111001011010010010110010111011111011110110111101111011111011110110110100111010011010100110011011111000101000100110101010111011101000011110101010111010011001101010110000111011111011110110110011111010001010111010001110111001011010010110000111111011111011110110111101111011111011110110101101 e9a1abe8b6a3efbdbdefbdbfe98bbae99b81efbdaee5a4b2efbdbdefbdb4e9a99be289aaee87aae99ab0efbdb3e8ae8ee5a587efbdbdefbdad
UHC 顫趣???雁????驛≪????奇?? 11101111101101011111011010101100001111110011111100111111111001001101001000111111001111110011111100111111111001101011111010100001111011000011111100111111001111110011111111010000111101000011111100111111 efb5f6ac3f3f3fe4d23f3f3f3fe6bea1ec3f3f3f3fd0f43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)