To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 阿??泣??濡μ?v阿??泣??濡μ?vB 1000100010100010001111110011111110001011100000110011111100111111100101000100011110000011110010100011111101110110100010001010001000111111001111111000101110000011001111110011111110010100010001111000001111001010001111110111011001000010 88a23f3f8b833f3f944783ca3f7688a23f3f8b833f3f944783ca3f7642
EUC-JP 阿??泣?ł濡μ?v阿??泣?ł濡μ?vB 101100001010010000111111001111111011010111100011001111111000111110101001110010001100011110101000101001101100110000111111011101101011000010100100001111110011111110110101111000110011111110001111101010011100100011000111101010001010011011001100001111110111011001000010 b0a43f3fb5e33f8fa9c8c7a8a6cc3f76b0a43f3fb5e33f8fa9c8c7a8a6cc3f7642
UTF-8 阿잛빢泣먪ł濡μ럨v阿잛빢泣먪ł濡μ럨vB 1110100110011000101111111110110010011110100110111110101110111001101000101110011010110011101000111110101110101000101010101100010110000010111001101011111110100001110011101011110011101011100111111010100001110110111010011001100010111111111011001001111010011011111010111011100110100010111001101011001110100011111010111010100010101010110001011000001011100110101111111010000111001110101111001110101110011111101010000111011001000010 e998bfec9e9bebb9a2e6b3a3eba8aac582e6bfa1cebceb9fa876e998bfec9e9bebb9a2e6b3a3eba8aac582e6bfa1cebceb9fa87642
UHC 阿잛빢泣먪ł濡μ럨v阿잛빢泣먪ł濡μ럨vB 111001001011100110011111111011001001010110111110111010111110100010010000111001111010100110101001111010111010000110100101111011001000111010001011011101101110010010111001100111111110110010010101101111101110101111101000100100001110011110101001101010011110101110100001101001011110110010001110100010110111011001000010 e4b99fec95beebe890e7a9a9eba1a5ec8e8b76e4b99fec95beebe890e7a9a9eba1a5ec8e8b7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)