To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?щ?冶?????冶?? 001111111000010010001011001111111001011011101000001111110011111100111111001111110011111110010110111010000011111100111111 3f848b3f96e83f3f3f3f3f96e83f3f
EUC-JP ?щ?冶?????冶?? 001111111010011111101011001111111100110011101010001111110011111100111111001111110011111111001100111010100011111100111111 3fa7eb3fccea3f3f3f3f3fccea3f3f
UTF-8 寧щ젚冶먰쓽溜싲젚冶먰벢 1110111110100110101010101101000110001001111011001010000010011010111001011000011010110110111010111010100010110000111011001001001110111101111011111010011110001011111011001000101110110010111011001010000010011010111001011000011010110110111010111010100010110000111010111011001010100010 efa6aad189eca09ae586b6eba8b0ec93bdefa78bec8bb2eca09ae586b6eba8b0ebb2a2
UHC 寧щ젚冶먰쓽溜싲젚冶먰벢 111001111010110010101100111010111010000010010110111001011010011110010000111011011001110110011000111010101111111010011010111010111010000010010110111001011010011110010000111011011001001110111011 e7acaceba096e5a790ed9d98eafe9aeba096e5a790ed93bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)