To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ???暎←?悅??}v???暎←?悅??}vB 0011111100111111001111111001110111110011100000011010100100111111111110101011110100111111001111110111110101110110001111110011111100111111100111011111001110000001101010010011111111111010101111010011111100111111011111010111011001000010 3f3f3f9df381a93ffabd3f3f7d763f3f3f9df381a93ffabd3f3f7d7642
EUC-JP ???暎←????}v???暎←????}vB 001111110011111100111111110110101111010110100010101010110011111100111111001111110011111101111101011101100011111100111111001111111101101011110101101000101010101100111111001111110011111100111111011111010111011001000010 3f3f3fdaf5a2ab3f3f3f3f7d763f3f3fdaf5a2ab3f3f3f3f7d7642
UTF-8 鍊꿰룉暎←뒼悅덄뜷}v鍊꿰룉暎←뒼悅덄뜷}vB 1110111110100110100110111110101010111111101100001110101110100011100010011110011010011010100011101110001010000110100100001110101110010010101111001110011010000010100001011110101110001101100001001110101110011100101101110111110101110110111011111010011010011011111010101011111110110000111010111010001110001001111001101001101010001110111000101000011010010000111010111001001010111100111001101000001010000101111010111000110110000100111010111001110010110111011111010111011001000010 efa69beabfb0eba389e69a8ee28690eb92bce68285eb8d84eb9cb77d76efa69beabfb0eba389e69a8ee28690eb92bce68285eb8d84eb9cb77d7642
UHC 鍊꿰룉暎←뒼悅덄뜷}v鍊꿰룉暎←뒼悅덄뜷}vB 1110011011101000101100101110011110001111100010001110011110110010101000011110011110001010101100101110011011101101100010001110011110001101101101010111110101110110111001101110100010110010111001111000111110001000111001111011001010100001111001111000101010110010111001101110110110001000111001111000110110110101011111010111011001000010 e6e8b2e78f88e7b2a1e78ab2e6ed88e78db57d76e6e8b2e78f88e7b2a1e78ab2e6ed88e78db57d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)