To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歪??悠??惟щ?嚴щ????鷹??????? 100110000110001100111111001111111001011101001001001111110011111110001000110100101000010010001011001111111001101010001110100001001000101100111111001111110011111100111111100100011110100100111111001111110011111100111111001111110011111100111111 98633f3f97493f3f88d2848b3f9a8e848b3f3f3f3f91e93f3f3f3f3f3f3f
EUC-JP 歪??悠??惟щ?嚴щ?佾??鷹?????堉? 11001111110001000011111100111111110011011010101000111111001111111011000011010100101001111110101100111111110100111110111010100111111010110011111110001111101100001111101100111111001111111100001011101011001111110011111100111111001111110011111110001111101101111111110100111111 cfc43f3fcdaa3f3fb0d4a7eb3fd3eea7eb3f8fb0fb3f3fc2eb3f3f3f3f3f8fb7fd3f
UTF-8 歪뺤옓悠띷끽惟щ룆嚴щ뀡佾ⓨ뼦鷹녿뭽麗몃씈堉텲 11100110101011011010101011101011101110101010010011101100100110001001001111100110100000101010000011101011100111011011011111101011100000011011110111100110100000111001111111010001100010011110101110100011100001101110010110011010101101001101000110001001111010111000000010100001111001001011110110111110111000101001001110101000111010111011110010100110111010011011011110111001111010111000010110111111111010111010110110111101111011111010011010001000111010111010101010000011111011001001010010001000111001011010000010001001111011011000010110110010 e6adaaebbaa4ec9893e682a0eb9db7eb81bde6839fd189eba386e59ab4d189eb80a1e4bdbee293a8ebbca6e9b7b9eb85bfebadbdefa688ebaa83ec9488e5a089ed85b2
UHC 歪뺤옓悠띷끽惟щ룆嚴щ뀡佾ⓨ뼦鷹녿뭽麗몃씈堉텲 11101000111000001001010111101100100111101001100111101010111011011000110111100110101100111010001111101010111011101010110011101011100011111000010111100101111100011010110011101011100001011001100011101100111010111010100011100101100101101010100111101011111011011000011011101011100100101000110011100110101100001011100011101011100111011010000011101011101111001011011101000101 e8e095ec9e99eaed8de6b3a3eaeeaceb8f85e5f1aceb8598eceba8e596a9ebed86eb928ce6b0b8eb9da0ebbcb745

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)