To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 魚??徇??慂ъ?v魚??徇??慂ъ?vB 1000101110011011001111110011111110011100011011010011111100111111100111001100100010000100100011000011111101110110100010111001101100111111001111111001110001101101001111110011111110011100110010001000010010001100001111110111011001000010 8b9b3f3f9c6d3f3f9cc8848c3f768b9b3f3f9c6d3f3f9cc8848c3f7642
EUC-JP 魚??徇??慂ъ?v魚??徇??慂ъ?vB 1011010111111011001111110011111111010111110011100011111100111111110110001100101010100111111011000011111101110110101101011111101100111111001111111101011111001110001111110011111111011000110010101010011111101100001111110111011001000010 b5fb3f3fd7ce3f3fd8caa7ec3f76b5fb3f3fd7ce3f3fd8caa7ec3f7642
UTF-8 魚뚦땭徇먮젧慂ъ쇊v魚뚦땭徇먮젧慂ъ쇊vB 11101001101011011001101011101011100110101010011011101011100101011010110111100101101111101000011111101011101010001010111011101100101000001010011111100110100001011000001011010001100010101110110010000111100010100111011011101001101011011001101011101011100110101010011011101011100101011010110111100101101111101000011111101011101010001010111011101100101000001010011111100110100001011000001011010001100010101110110010000111100010100111011001000010 e9ad9aeb9aa6eb95ade5be87eba8aeeca0a7e68582d18aec878a76e9ad9aeb9aa6eb95ade5be87eba8aeeca0a7e68582d18aec878a7642
UHC 魚뚦땭徇먮젧慂ъ쇊v魚뚦땭徇먮젧慂ъ쇊vB 111001011110000010001100111001011000101110000011111000101101111110010000111010111010000010011111111010011011110110101100111011001001100110111100011101101110010111100000100011001110010110001011100000111110001011011111100100001110101110100000100111111110100110111101101011001110110010011001101111000111011001000010 e5e08ce58b83e2df90eba09fe9bdacec99bc76e5e08ce58b83e2df90eba09fe9bdacec99bc7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)