To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥〓?誼??巡?2筌??二????2筌??二ф? 1001101010001011100000011010110000111111100010110110001000111111001111111000111110000100001111111000001001010001111000101010001100111111001111111001001111110001001111110011111100111111001111111000001001010001111000101010001100111111001111111001001111110001100001001000011000111111 9a8b81ac3f8b623f3f8f843f8251e2a33f3f93f13f3f3f3f8251e2a33f3f93f184863f
EUC-JP 嚥〓?誼??巡?2筌??二??洧?2筌??二ф? 11010011111010111010001010101110001111111011010111000011001111110011111110111101111001000011111110100011101100101110010010100101001111110011111111000110111100110011111100111111100011111100011110110100001111111010001110110010111001001010010100111111001111111100011011110011101001111110011000111111 d3eba2ae3fb5c33f3fbde43fa3b2e4a53f3fc6f33f3f8fc7b43fa3b2e4a53f3fc6f3a7e63f
UTF-8 嚥〓씭誼뚪쉬巡볦2筌뗫뗀二뀐쭓洧얠2筌뗫봾二ф에 1110010110011010101001011110001110000000100100111110110010010100101011011110100010101010101111001110101110011010101010101110110010001001101011001110010110110111101000011110101110110011101001101110111110111100100100101110011110101101100011001110101110010111101010111110101110010111100000001110010010111010100011001110101110000000100100001110110010101101100100111110011010110100101001111110110010010110101000001110111110111100100100101110011110101101100011001110101110010111101010111110101110110100101111101110010010111010100011001101000110000100111011001001011110010000 e59aa5e38093ec94ade8aabceb9aaaec89ace5b7a1ebb3a6efbc92e7ad8ceb97abeb9780e4ba8ceb8090ecad93e6b4a7ec96a0efbc92e7ad8ceb97abebb4bee4ba8cd184ec9790
UHC 嚥〓씭誼뚪쉬巡볦2筌뗫뗀二뀐쭓洧얠2筌뗫봾二ф에 111001101011111110100001111010111001110110111110111010111111111010001100111010011011110110101100111000101101111010010011111011001010001110110010111011111010011110001011111010111011011010111110111011001010001110110010111011111010011110001011111010101111101110111110111011001010001110110010111011111010011110001011111010111001010010000101111011001010001110101100111001101011111110100001 e6bfa1eb9dbeebfe8ce9bdace2de93eca3b2efa78bebb6beeca3b2efa78beafbbeeca3b2efa78beb9485eca3ace6bfa1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)