To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???泣ф?乙μ?v???泣ф?乙μ?vB 0011111100111111001111111000101110000011100001001000011000111111100010011011001110000011110010100011111101110110001111110011111100111111100010111000001110000100100001100011111110001001101100111000001111001010001111110111011001000010 3f3f3f8b8384863f89b383ca3f763f3f3f8b8384863f89b383ca3f7642
EUC-JP ???泣фˇ乙μ?v???泣фˇ乙μ?vB 001111110011111100111111101101011110001110100111111001101000111110100010101100001011001010110101101001101100110000111111011101100011111100111111001111111011010111100011101001111110011010001111101000101011000010110010101101011010011011001100001111110111011001000010 3f3f3fb5e3a7e68fa2b0b2b5a6cc3f763f3f3fb5e3a7e68fa2b0b2b5a6cc3f7642
UTF-8 黎싳뼲泣фˇ乙μ죦v黎싳뼲泣фˇ乙μ죦vB 111011111010011010001001111011001000101110110011111010111011110010110010111001101011001110100011110100011000010011001011100001111110010010111001100110011100111010111100111011001010001110100110011101101110111110100110100010011110110010001011101100111110101110111100101100101110011010110011101000111101000110000100110010111000011111100100101110011001100111001110101111001110110010100011101001100111011001000010 efa689ec8bb3ebbcb2e6b3a3d184cb87e4b999cebceca3a676efa689ec8bb3ebbcb2e6b3a3d184cb87e4b999cebceca3a67642
UHC 黎싳뼲泣фˇ乙μ죦v黎싳뼲泣фˇ乙μ죦vB 111001101011000110011010111011001001011010110101111010111110100010101100111001101010001010100111111010111110000010100101111011001010000110000001011101101110011010110001100110101110110010010110101101011110101111101000101011001110011010100010101001111110101111100000101001011110110010100001100000010111011001000010 e6b19aec96b5ebe8ace6a2a7ebe0a5eca18176e6b19aec96b5ebe8ace6a2a7ebe0a5eca1817642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)