To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 押??唯??飮??}v押??唯??飮??}vB 1000100110011111001111110011111110010111010000100011111100111111100111110101101000111111001111110111110101110110100010011001111100111111001111111001011101000010001111110011111110011111010110100011111100111111011111010111011001000010 899f3f3f97423f3f9f5a3f3f7d76899f3f3f97423f3f9f5a3f3f7d7642
EUC-JP 押??唯??飮??}v押??唯??飮??}vB 1011001010100001001111110011111111001101101000110011111100111111110111011011101100111111001111110111110101110110101100101010000100111111001111111100110110100011001111110011111111011101101110110011111100111111011111010111011001000010 b2a13f3fcda33f3fddbb3f3f7d76b2a13f3fcda33f3fddbb3f3f7d7642
UTF-8 押꾨툋唯롥깷飮곴덩}v押꾨툋唯롥깷飮곴덩}vB 1110011010001010101111001110101010111110101010001110110110001000100010111110010110010100101011111110101110100001101001011110101010111001101101111110100110100011101011101110101010110011101101001110101110001101101010010111110101110110111001101000101010111100111010101011111010101000111011011000100010001011111001011001010010101111111010111010000110100101111010101011100110110111111010011010001110101110111010101011001110110100111010111000110110101001011111010111011001000010 e68abceabea8ed888be594afeba1a5eab9b7e9a3aeeab3b4eb8da97d76e68abceabea8ed888be594afeba1a5eab9b7e9a3aeeab3b4eb8da97d7642
UHC 押꾨툋唯롥깷飮곴덩}v押꾨툋唯롥깷飮곴덩}vB 1110010011100011100001001110101110111000100000111110101011100110100011101110010110000011101001011110101111100110100000011110101010110101101000100111110101110110111001001110001110000100111010111011100010000011111010101110011010001110111001011000001110100101111010111110011010000001111010101011010110100010011111010111011001000010 e4e384ebb883eae68ee583a5ebe681eab5a27d76e4e384ebb883eae68ee583a5ebe681eab5a27d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)