To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鈺??艤①????娃??鈺??艤①????汚??^ 111110111100010000111111001111111110010001111110100001110100000000111111001111110011111100111111100010001010000100111111001111111111101111000100001111110011111111100100011111101000011101000000001111110011111100111111001111111000100110011000001111110011111101011110 fbc43f3fe47e87403f3f3f3f88a13f3ffbc43f3fe47e87403f3f3f3f89983f3f5e
EUC-JP 鈺??艤??洧??娃??鈺??艤??洧??汚??^ 10001111111000111101010100111111001111111110011111011111001111110011111110001111110001111011010000111111001111111011000010100011001111110011111110001111111000111101010100111111001111111110011111011111001111110011111110001111110001111011010000111111001111111011000111111000001111110011111101011110 8fe3d53f3fe7df3f3f8fc7b43f3fb0a33f3f8fe3d53f3fe7df3f3f8fc7b43f3fb1f83f3f5e
UTF-8 鈺됱눃艤①륫洧뺝벧娃뉖젇鈺됱눃艤①륫洧뺝벧汚삳젛^ 11101001100010001011101011101011100100001011000111101011100010001000001111101000100010011010010011100010100100011010000011101011101001011010101111100110101101001010011111101011101110101001110111101011101100101010011111100101101010001000001111101011100010011001011011101100101000001000011111101001100010001011101011101011100100001011000111101011100010001000001111101000100010011010010011100010100100011010000011101011101001011010101111100110101101001010011111101011101110101001110111101011101100101010011111100110101100011001101011101100100000101011001111101100101000001001101101011110 e988baeb90b1eb8883e889a4e291a0eba5abe6b4a7ebba9debb2a7e5a883eb8996eca087e988baeb90b1eb8883e889a4e291a0eba5abe6b4a7ebba9debb2a7e6b19aec82b3eca09b5e
UHC 鈺됱눃艤①륫洧뺝벧娃뉖젇鈺됱눃艤①륫洧뺝벧汚삳젛^ 11101000101011011000100111101100100001111010010011101011111110101010100011100111101110001010000111101010111110111001010111100101101110101010011011101000110111111000011111101011101000001000101011101000101011011000100111101100100001111010010011101011111110101010100011100111101110001010000111101010111110111001010111100101101110101010011011100111111111011011101111101011101000001001011101011110 e8ad89ec87a4ebfaa8e7b8a1eafb95e5baa6e8df87eba08ae8ad89ec87a4ebfaa8e7b8a1eafb95e5baa6e7fdbbeba0975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)