To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??宋??齬??竊??儀??億??乳 0011111100111111001111111000101110000011001111110011111110010001011101100011111100111111111010101001011100111111001111111110001010000110001111110011111110001011010101100011111100111111100010011010110100111111001111111001001111111011 3f3f3f8b833f3f91763f3fea973f3fe2863f3f8b563f3f89ad3f3f93fb
EUC-JP ???泣??宋??齬??竊??儀??億??乳 0011111100111111001111111011010111100011001111110011111111000001110101110011111100111111111100111111011100111111001111111110001111100110001111110011111110110101101101110011111100111111101100101010111100111111001111111100011011111101 3f3f3fb5e33f3fc1d73f3ff3f73f3fe3e63f3fb5b73f3fb2af3f3fc6fd
UTF-8 捻뀁늿泣쒏뉘宋믩겱齬잆룂竊먲쬉儀뺢턂億됰슗乳 111011111010011010100100111010111000000010000001111010111000101010111111111001101011001110100011111011001001001010001111111010111000100110011000111001011010111010001011111010111010111110101001111010101011001010110001111010011011110110101100111011001001111010000110111010111010001110000010111001111010101110001010111010111010100010110010111011001010110010001001111001011000010010000000111010111011101010100010111011011000010010000010111001011000010010000100111010111001000010110000111011001000101010010111111001001011100110110011 efa6a4eb8081eb8abfe6b3a3ec928feb8998e5ae8bebafa9eab2b1e9bdacec9e86eba382e7ab8aeba8b2ecac89e58480ebbaa2ed8482e58484eb90b0ec8a97e4b9b3
UHC 捻뀁늿泣쒏뉘宋믩겱齬잆룂竊먲쬉儀뺢턂億됰슗乳 1110011011110111101100101110110010001000100010001110101111101000100111001110011010110100101101011110000111100100100100101110101110000001101111011110010111100001100111111110001110001111100000111110111110111100100100001110111110100110100111111110101111110000100101011110101010110101100111101110010111100010100010011110101110011010101001101110101011100001 e6f7b2ec8888ebe89ce6b4b5e1e492eb81bde5e19fe38f83efbc90efa69febf095eab59ee5e289eb9aa6eae1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)