To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??遊????????魏ラ?擬?????唯 10001000101000110011111100111111100101110101011000111111001111110011111100111111001111110011111100111111001111111110100110110000100000111000100100111111100010110101101100111111001111110011111100111111001111111001011101000010 88a33f3f97563f3f3f3f3f3f3f3fe9b083893f8b5b3f3f3f3f3f9742
EUC-JP 哀??遊????????魏ラ?擬?????唯 10110000101001010011111100111111110011011011011100111111001111110011111100111111001111110011111100111111001111111111001010110010101001011110100100111111101101011011110000111111001111110011111100111111001111111100110110100011 b0a53f3fcdb73f3f3f3f3f3f3f3ff2b2a5e93fb5bc3f3f3f3f3fcda3
UTF-8 哀얜챶遊양댚硫깅닱列룸챷魏ラ뒽擬쒙폍連얇굠唯 111001011001001110000000111011001001011010011100111011001011000110110110111010011000000110001010111011001001011010010001111010111000110010011010111011111010011110001110111010101011100110000101111010111000101110110001111011111010011010011100111010111010001110111000111011001011000110110111111010011010110110001111111000111000001110101001111010111001001010111101111001101001001110101100111011001001001010011001111011011000111110001101111011111010011010011010111011001001011010000111111010101011010110100000111001011001010010101111 e59380ec969cecb1b6e9818aec9691eb8c9aefa78eeab985eb8bb1efa69ceba3b8ecb1b7e9ad8fe383a9eb92bde693acec9299ed8f8defa69aec9687eab5a0e594af
UHC 哀얜챶遊양댚硫깅닱列룸챷魏ラ뒽擬쒙폍連얇굠唯 1110010011101110101111101110101110101010100000111110101110110100101111101110011110001000101111101110101110101001101100011110101110001000101001111110011011101010101101111110101110101010100001001110101011100000101010111110100110001010101100111110101111110100100111001110111110111100100110001110011011100110101111101110001110000010100010001110101011100110 e4eebeebaa83ebb4bee788beeba9b1eb88a7e6eab7ebaa84eae0abe98ab3ebf49cefbc98e6e6bee38288eae6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)