To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??韋??循??冗??宥??攸??魚??搖 100011001110101100111111001111111110100011101000001111110011111110001111011110100011111100111111100011111110011100111111001111111001011101000111001111110011111110011101101111110011111100111111100010111001101100111111001111111001110110001010 8ceb3f3fe8e83f3f8f7a3f3f8fe73f3f97473f3f9dbf3f3f8b9b3f3f9d8a
EUC-JP 誤??韋??循??冗??宥??攸??魚??搖 101110001110110100111111001111111111000011101010001111110011111110111101110110110011111100111111101111101110100100111111001111111100110110101000001111110011111111011010110000010011111100111111101101011111101100111111001111111101100111101010 b8ed3f3ff0ea3f3fbddb3f3fbee93f3fcda83f3fdac13f3fb5fb3f3fd9ea
UTF-8 誤곸룆韋귝쨫循녿겱冗밴낱宥욅춯攸꾪뜑魚좏뒞搖 111010001010101010100100111010101011001110111000111010111010001110000110111010011001111110001011111010101011011110011101111011001010100010101011111001011011111010101010111010111000010110111111111010101011001010110001111001011000011010010111111010111011000010110100111010111000001010110001111001011010111010100101111011001001101010000101111011001011011010101111111001101001010010111000111010101011111010101010111010111001110010010001111010011010110110011010111011001010001010001111111010111001001010011110111001101001000010010110 e8aaa4eab3b8eba386e99f8beab79deca8abe5beaaeb85bfeab2b1e58697ebb0b4eb82b1e5aea5ec9a85ecb6afe694b8eabeaaeb9c91e9ad9aeca28feb929ee69096
UHC 誤곸룆韋귝쨫循녿겱冗밴낱宥욅춯攸꾪뜑魚좏뒞搖 1110100010100110100000011110110010001111100001011110101011011111100000101110011010100100100001011110001011100000100001101110101110000001101111011110100110110111101110011110101010110011101110011110101011101001100111101110011110101101100011001110101011110010100001001110110110001101100101001110010111100000101000001110110110001010100110101110100011110100 e8a681ec8f85eadf82e6a485e2e086eb81bde9b7b9eab3b9eae99ee7ad8ceaf284ed8d94e5e0a0ed8a9ae8f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)