To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??誼??音??筌??苑ょ?矣??嚴щ?誼 1001011011101001001111110011111110001011011000100011111100111111100010011011100100111111001111111110001010100011001111110011111110001001100100011000001011100101001111111110000111100001001111110011111110011010100011101000010010001011001111111000101101100010 96e93f3f8b623f3f89b93f3fe2a33f3f899182e53fe1e13f3f9a8e848b3f8b62
EUC-JP 夜??誼??音??筌??苑ょ?矣??嚴щ?誼 1100110011101011001111110011111110110101110000110011111100111111101100101011101100111111001111111110010010100101001111110011111110110001111100011010010011100111001111111110001011100011001111110011111111010011111011101010011111101011001111111011010111000011 cceb3f3fb5c33f3fb2bb3f3fe4a53f3fb1f1a4e73fe2e33f3fd3eea7eb3fb5c3
UTF-8 夜쏅뗀誼⑼쬂音곗졑筌뚮뿫苑ょ솾矣뤿뼂嚴щ뎽誼 1110010110100100100111001110110010001111100001011110101110010111100000001110100010101010101111001110001010010001101111001110110010101100100000101110100110011111101100111110101010110011100101111110110010100001100100011110011110101101100011001110101110011010101011101110101110111111101010111110100010001011100100011110001110000010100001111110110010000110101111101110011110011111101000111110101110100100101111111110101110111100100000101110010110011010101101001101000110001001111010111000111010111101111010001010101010111100 e5a49cec8f85eb9780e8aabce291bcecac82e99fb3eab397eca191e7ad8ceb9aaeebbfabe88b91e38287ec86bee79fa3eba4bfebbc82e59ab4d189eb8ebde8aabc
UHC 夜쏅뗀誼⑼쬂音곗졑筌뚮뿫苑ょ솾矣뤿뼂嚴щ뎽誼 1110010110101000100110111110101110110110101111101110101111111110101010011110111110100110100110011110101111100101101100001110110010100000101111101110111110100111100011001110101110010111101010111110101010111101101010101110011110011001101100101110101111111000100011111110101110010110100011001110010111110001101011001110101110001001100100001110101111111110 e5a89bebb6beebfea9efa699ebe5b0eca0beefa78ceb97abeabdaae799b2ebf88feb968ce5f1aceb8990ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)