To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 塋よ?癌??倭??}v塋よ?癌??倭??}vB 10011010110010001000001011100110001111111000101011100000001111110011111110011000011000000011111100111111011111010111011010011010110010001000001011100110001111111000101011100000001111110011111110011000011000000011111100111111011111010111011001000010 9ac882e63f8ae03f3f98603f3f7d769ac882e63f8ae03f3f98603f3f7d7642
EUC-JP 塋よ?癌??倭??}v塋よ?癌??倭??}vB 11010100110010101010010011101000001111111011010011100010001111110011111111001111110000010011111100111111011111010111011011010100110010101010010011101000001111111011010011100010001111110011111111001111110000010011111100111111011111010111011001000010 d4caa4e83fb4e23f3fcfc13f3f7d76d4caa4e83fb4e23f3fcfc13f3f7d7642
UTF-8 塋よ쥤癌닷렘倭졽퍟}v塋よ쥤癌닷렘倭졽퍟}vB 1110010110100001100010111110001110000010100010001110110010100101101001001110011110011001100011001110101110001011101101111110101110100000100110001110010110000000101011011110110010100001101111011110110110001101100111110111110101110110111001011010000110001011111000111000001010001000111011001010010110100100111001111001100110001100111010111000101110110111111010111010000010011000111001011000000010101101111011001010000110111101111011011000110110011111011111010111011001000010 e5a18be38288eca5a4e7998ceb8bb7eba098e580adeca1bded8d9f7d76e5a18be38288eca5a4e7998ceb8bb7eba098e580adeca1bded8d9f7d7642
UHC 塋よ쥤癌닷렘倭졽퍟}v塋よ쥤癌닷렘倭졽퍟}vB 1110011110101011101010101110100010100010100101101110010011011111101101001110010110110111101111011110100011011110101000001110010010111011100101100111110101110110111001111010101110101010111010001010001010010110111001001101111110110100111001011011011110111101111010001101111010100000111001001011101110010110011111010111011001000010 e7abaae8a296e4dfb4e5b7bde8dea0e4bb967d76e7abaae8a296e4dfb4e5b7bde8dea0e4bb967d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)