To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 俑??肉??碎??}v俑??肉??碎??}vB 1001100011011010001111110011111110010011111101110011111100111111111000011110101000111111001111110111110101110110100110001101101000111111001111111001001111110111001111110011111111100001111010100011111100111111011111010111011001000010 98da3f3f93f73f3fe1ea3f3f7d7698da3f3f93f73f3fe1ea3f3f7d7642
EUC-JP 俑??肉??碎??}v俑??肉??碎??}vB 1101000011011100001111110011111111000110111110010011111100111111111000101110110000111111001111110111110101110110110100001101110000111111001111111100011011111001001111110011111111100010111011000011111100111111011111010111011001000010 d0dc3f3fc6f93f3fe2ec3f3f7d76d0dc3f3fc6f93f3fe2ec3f3f7d7642
UTF-8 俑앸끁肉덂럳碎댄뀯}v俑앸끁肉덂럳碎댄뀯}vB 1110010010111111100100011110110010010101101110001110101110000001100000011110100010000010100010011110101110001101100000101110101110011111101100111110011110100010100011101110101110001100100001001110101110000000101011110111110101110110111001001011111110010001111011001001010110111000111010111000000110000001111010001000001010001001111010111000110110000010111010111001111110110011111001111010001010001110111010111000110010000100111010111000000010101111011111010111011001000010 e4bf91ec95b8eb8181e88289eb8d82eb9fb3e7a28eeb8c84eb80af7d76e4bf91ec95b8eb8181e88289eb8d82eb9fb3e7a28eeb8c84eb80af7d7642
UHC 俑앸끁肉덂럳碎댄뀯}v俑앸끁肉덂럳碎댄뀯}vB 1110100110110101100111011110101110000101101101111110101110111111100010001110010110001110100100111110000111101111101101001110110110000101101001010111110101110110111010011011010110011101111010111000010110110111111010111011111110001000111001011000111010010011111000011110111110110100111011011000010110100101011111010111011001000010 e9b59deb85b7ebbf88e58e93e1efb4ed85a57d76e9b59deb85b7ebbf88e58e93e1efb4ed85a57d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)