To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN ?る?違э?怨??i?る?違э?怨??iB 0011111110000010111010010011111110001000111000011000010010001111001111111000100110000101001111110011111101101001001111111000001011101001001111111000100011100001100001001000111100111111100010011000010100111111001111110110100101000010 3f82e93f88e1848f3f89853f3f693f82e93f88e1848f3f89853f3f6942
EUC-JP ?る?違э?怨??i?る?違э?怨??iB 0011111110100100111010110011111110110000111000111010011111101111001111111011000111100101001111110011111101101001001111111010010011101011001111111011000011100011101001111110111100111111101100011110010100111111001111110110100101000010 3fa4eb3fb0e3a7ef3fb1e53f3f693fa4eb3fb0e3a7ef3fb1e53f3f6942
UTF-8 閭る틶違э쭓怨뺤젘i閭る틶違э쭓怨뺤젘iB 11101111101001101000011011100011100000101000101111101101100010111011011011101001100000011001010111010001100011011110110010101101100100111110011010000000101010001110101110111010101001001110110010100000100110000110100111101111101001101000011011100011100000101000101111101101100010111011011011101001100000011001010111010001100011011110110010101101100100111110011010000000101010001110101110111010101001001110110010100000100110000110100101000010 efa686e3828bed8bb6e98195d18decad93e680a8ebbaa4eca09869efa686e3828bed8bb6e98195d18decad93e680a8ebbaa4eca0986942
UHC 閭る틶違э쭓怨뺤젘i閭る틶違э쭓怨뺤젘iB 111001101010110110101010111010111011101010011101111010101101111010101100111011111010011110001011111010101011001110010101111011001010000010010100011010011110011010101101101010101110101110111010100111011110101011011110101011001110111110100111100010111110101010110011100101011110110010100000100101000110100101000010 e6adaaebba9deadeacefa78beab395eca09469e6adaaebba9deadeacefa78beab395eca0946942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)