To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????V}v?????????V}vB 00111111001111110011111100111111001111110011111100111111001111110011111101010110011111010111011000111111001111110011111100111111001111110011111100111111001111110011111101010110011111010111011001000010 3f3f3f3f3f3f3f3f3f567d763f3f3f3f3f3f3f3f3f567d7642
SJIS-WIN 役?????筍ろ?V}v役?????筍ろ?V}vB 10010110111100000011111100111111001111110011111100111111111000101010000110000010111010110011111101010110011111010111011010010110111100000011111100111111001111110011111100111111111000101010000110000010111010110011111101010110011111010111011001000010 96f03f3f3f3f3fe2a182eb3f567d7696f03f3f3f3f3fe2a182eb3f567d7642
EUC-JP 役?????筍ろ?V}v役?????筍ろ?V}vB 11001100111100100011111100111111001111110011111100111111111001001010001110100100111011010011111101010110011111010111011011001100111100100011111100111111001111110011111100111111111001001010001110100100111011010011111101010110011111010111011001000010 ccf23f3f3f3f3fe4a3a4ed3f567d76ccf23f3f3f3f3fe4a3a4ed3f567d7642
UTF-8 役대끇六쀨떏筍ろ렦V}v役대끇六쀨떏筍ろ렦V}vB 11100101101111011011100111101011100011001000000011101011100000011000011111101111101001111001000111101100100000001010100011101011100101101000111111100111101011011000110111100011100000101000110111101011101000001010011001010110011111010111011011100101101111011011100111101011100011001000000011101011100000011000011111101111101001111001000111101100100000001010100011101011100101101000111111100111101011011000110111100011100000101000110111101011101000001010011001010110011111010111011001000010 e5bdb9eb8c80eb8187efa791ec80a8eb968fe7ad8de3828deba0a6567d76e5bdb9eb8c80eb8187efa791ec80a8eb968fe7ad8de3828deba0a6567d7642
UHC 役대끇六쀨떏筍ろ렦V}v役대끇六쀨떏筍ろ렦V}vB 11100110101101011011010011101011100001011011101111101011101110111001011111101000100010111010010111100010111011001010101011101101100011101011010101010110011111010111011011100110101101011011010011101011100001011011101111101011101110111001011111101000100010111010010111100010111011001010101011101101100011101011010101010110011111010111011001000010 e6b5b4eb85bbebbb97e88ba5e2ecaaed8eb5567d76e6b5b4eb85bbebbb97e88ba5e2ecaaed8eb5567d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)